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CHAPTER  I 


INTRODUCTION 

1.  I  Motivation 

Suppose  that  a  process  is  sampled  at  consecutive  points  and  it 
is  known  that  at  least  the  first  observations  satisfy  certain  distribu¬ 
tional  properties,  but  after  some  unknown  point  the  remaining  obser¬ 
vations  may  satisfy  other  distributional  properties.  It  is  of  interest 
to  determine  whether  such  a  change  has  occurred,  and  if  so,  to  esti¬ 
mate  the  point  of  change  and  the  current  properties  of  the  process. 

This  situation  may  arise  in  a  number  of  applications.  For  example, 
suppose  specifications  for  the  manufacturing  of  batteries  allows  for 
no  more  than  a  certain  percentage  of  defectives  to  be  produced;  a 
method  for  determining  whether  the  percentage  has  exceeded  this  value 
is  needed.  If  the  manufacturer  decides  that  product  quality  has  deteri¬ 
orated,  he  may  want  to  estimate  when  the  percentage  of  defectives 
increased  so  that  these  batteries  are  not  sold.  As  another  example, 
consider  the  sales  record  of  wool  fabric;  if  sales  increase  production 
may  need  to  be  stepped  up  to  keep  the  fabric  available, or  if  sales 
decrease  the  production  may  need  to  be  slowed  down.  In  this  instance, 
not  only  is  the  estimate  of  when  a  change  in  sales  occurred  of  interest, 
but  also  an  estimate  of  the  current  demand  for  the  product.  A  third 
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example  deals  with  the  medical  experimentation  of  a  new  drug  for 
treatment  of  pain.  If  the  data  indicate  that  the  drug  is  effective,  the 
researcher  may  want  to  estimate  the  reaction  time  of  the  drug  as  well 
as  a  measurement  of  the  patient's  improvement  for  the  dosage  pre¬ 
scribed, 

1.2  The  Statement  of  the  Problem 

The  following  specific  case  is  investigated  to  study  processes 
for  which  a  change  in  the  model  is  suspected.  Let  X  ,  •  •  • ,  be 
independent  Bernoulli  random  variables  taking  on  the  values  1  or  -1 
with  P(X.=1)=1-  P(X.=-l)=p  for  i  =  l,...,p  and  P(X.  =  1)  = 

1  -  P(X.  =  -1)  =  q  for  i  =  p+1,  •  •  • ,  n,  where  p  is  an  unknown  param¬ 
eter,  1  <  p  <  n,  and  p  and  q  are  distinct  Bernoulli  parameters, 
where  p  is  assumed  known  and  q  may  be  unknown.  If  p  <  n  we 
say  that  a  change  in  the  model  has  occurred,  p  is  called  the  change 
point,  that  is,  at  step  p  +  1  the  model  changes. 

We  are  interested  in  testing  whether  a  change  in  the  model  has 
occurred.  The  hypotheses  considered  are 

Hq:  p=n,  p  vs.  :  p<n,  p,  q,  p^q  for  the  two-sided  test 
1.  2.  1  with  p^q, 

H^:  p=n,  p  vs.  Hj :  p<n,  p,  q,  p>q  for  the  one-sided  test 
with  p>q, 

and  H^:  p=n,  p  vs.  :  p<n,  p,  q,  p<q  for  the  one-sided  test 


with  p<q 


We  are  furthermore  interested  in  the  situation  where  n  is  small, 
that  is,  n<20.  The  aim  is  to  obtain  a  test  for  small  n  which  is  easy 
to  use  and  which  will  detect  a  change  quickly. 


The  statistics  we  propose  for  testing  for  a  change  in  the  model 


are 


r  .  n 

S  =  S  (X,,  ....  X  )  =  sup  I  -  Z  X. —  E 

n  n'  1'  n'  |  r  .  ,  i  n-r  . 

r  i=l  i=r+l 


X. 


1. 2.  2 


for  Hq  vs.  H:  , 


4-  +  l  ^  t  n 

S  =  S  (X,,  .. X  )  =  sup  {-  Z  X.--±-  Z  X.  } 
n  n'  1’  *  n  r  r  •  ,  i  n-r  .  ,,  i 

r  i-l  i=r+l 


for  vs. 

1  r  1  n 

and  S  =  S  (X, ,  . . . ,  X  )  =  inf  {i-  Z  X.  -  -i-  Z  X.  } 

n  n  1’  n'  „  ,  i  n-r  .  _L,  i' 

r  i=l  i=r+l 


for  Hq  vs.  Hj  , 

1  r  1  n 

where  —  Z  X.  and  -  Z  X.  are  taken  to  be  zero  when  r  =  0 

r .  .  i  n-r  , ,  i 

i=l  i=r+l 

and  r  =  n,  respectively.  One  intuitively  expects  and  S+  to  be 

"large"  and  to  be  "small"  when  H^,  h|  and  H~,  respectively, 

are  true.  Note  that  [o,  2],  S+e  [o,  2j  and  S~  e  [-2,  o].  Clearly 

+  i  n  l  n 

S  >0;  S  >0  since  0<sup{-  Z  X.,  -  -  Z  X.  } 

n~  n-  ~  n  .  ,  l  n  .  ,  i 

i=l  i=l 

1  r  1  n  1  r  1  n 
=  sup  {i  Z  X.-—  Z  X,  }<sup{—  Z  X.-—  Z  X.}; 

A  r  j  ,  i  n-r  .  i  r  .  ,  i  n-r  ,  i " 

r=0,  n  i=l  i=r+l  r  i=l  i=r+l 


jh 

Ik 
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+  -  |  ]  r 

similarly  S<0.  S  <  2,  S  <2  and  S  >-2  since  -  Z  X  <1 
n—  n~  n—  I  r  .  ,  i  — 

i=l 

|  l  “  | 

and  I  n '  r  ^  ^  ^  for  aR  r  •  Hence  by  "large"  we  mean 

i=r+l 

"near  2"  and  by  "small"  we  mean  "near  -2". 

Define  by  R^,  R^  and  R^  the  random  variables  corresponding 

to  the  points  where  the  values  of  S  ,  S+  and  S~  ,  respectively,  are 

n  n  n  7 

attained.  For  obv'ius  reasons  we  will  refer  to  R  ,  R+  and  R~  as 

n  n  n 

the  change  point  random  variables.  To  insure  that  these  are  bonafide 
random  variables,  they  must  be  defined  uniquely.  Suppose  that  for  the 

Sn(X1,...,Xii,=  .up  |  I  EX,J_  Z  X  | 
r  i=l  i=r+l 

occurs  at  j  =  jj,  .  Denote  by  the  mapping  J^X^,  •••fX  ) 

=  Randomize  by  defining  the  function  U  so  that 

P  (U  o  Jn(Xj,  ....  Xfl)  =  ji)  =  I,  i  =  1,  ...,  i  .  Define  Rn(Xj,  •••.X) 

sU°J  (X  ,..,X).  Similarly  randomize  to  determine  R+  and  R~ 

1  n  n  n 

uniquely.  Fortunately  for  most  sequences,  the  test  statistics  occur  at 
a  unique  point;  also  for  these  sequences  for  which  the  supremum  does 
not  occur  uniquely,  l  is  rarely  larger  than  2. 

The  change  point  random  variables  are  of  interest  for  the  follow¬ 
ing  reason.  When  the  alternative  hypothesis  is  true,  we  expect  that 
the  point  r  at  which  the  statistic  is  attained  should  be  "close"  to  the 
change  point  p  .  We  will  in  fact  estimate  p  to  be  this  point  in  the  data 
sequence;  denote  the  estimator  by  p  .  (Nonuniqueness  difficulties  will 
be  resolved  by  randomizing.  )  When  q  is  unknown,  we  use  the  last 
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2  n  X.+l 

n-p  observations  to  estimate  it;  q  =  ~T  ^  5 - )  *s  an 

n-p  •  A  . ,  2 

r  i=p+l 

obvious  estimator  to  consider. 

1.3  Extensions 

The  statistics  we  have  proposed  for  testing  for  a  change  in  the 
model  will  be  studied  only  for  the  case  where  the  Xl^s  are  1  or  -1 
Bernoulli  random  variables,  but  the  test  can  be  used  for  more  general 
situations  as  shown  below. 

Note  that  the  test  statistics  are  invariant  under  scale  transfor¬ 
mations  and  independent  of  location  transformations.  S  is  invariant 

n 

for  the  scale  transformation  and  independent  of  the  location  transfor¬ 
mation  a  X.  +  b  where  a^O;  S+  and  S"  are  invariant  for  the  scale 
i  n  n 

transformation  and  independent  of  the  location  transformation  a  X.^  +b 

where  a>0  .  Thus  if  Yj,  are  independent  Bernoulli  random 

variables  taking  on  the  values  c  and  d  with  P(Y  =c)  =  1  -  P(Y  =d) 

i  i 

=  p  for  i  =  1,  •  •  • ,  p  and  P(Y.=c)  =  1  -  P(Y.=d)  =  q  for  i  =  p+l,  ••*,n 

we  can  test  whether  or  not  a  change  in  the  model  has  occurred;  in 
particular  for  the  two-sided  test  with  p  ^  q,  choose  a^O  and  b  such 
that  Y.  =a  X, +b,  i=l,*»»,n,  where  P  (X^=l)  =  1  -  P(X.=-1)  =  p  for 

i  =  1,  •  •  •,  p  and  P<X,  =  1 )  =  1  -  P(X^=-1)=  q  for  i  =  p+1,  •  •  • ,  n.  We 

have 

1  r  l  n 

S  (Y  ,  •  •  • ,  Y  )  =  sup  I  —  E  Y.  -  —  E  Y. 
n  l  n'  „  I  r  .  ,  i  n-r  .  i 

r  i=l  i=r+l 

11  r  1  n  I 

=  sup  i  E  (a  X.  +b)  -  —  E  (aX. +b) 
i r  .  ,  i  n-r  .  . ,  i  I 

r  i=l  i=r+l 


! 
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a  1 

sup  i  —  £  X.  +  —  •  rb 

'  r  j  i  1  r 

r  i=l 


=  |  a  |  sup  |  i  £  X.  - 
r  i=l 


a  | 


a 

n-r 


n 


£ 

i=r+l 


X.  - 


1 

n-r 


n 

£ 

i=r+l 


-i-  (n-r)b 
n-r 


For  given  p  and  <x ,  reject  the  null  hypothesis  for  Y  ,  •  •  • ,  Y^  that 

p  =  n  when  Yj,  • « • ,  Y^)  >  |  a  |  s^  if  and  only  if  S^(Xj,  •  •  • ,  X^)  >. 

s  where  «x  >  p  [s„ (x, .  *  *  *  •  x  )  ^  ®  |p  ]  • 

For  the  more  general  problem  where  Y^-v-Ffy;©)  for  i  =  1,  •  •  • ,  p, 

Y.<^F(y;  0' )  for  i  =  p+1,  •  •  • ,  n,  and  0  is  a  location  parameter  we  can 

use  our  setup  to  construct  a  nonparametric  test  for  detecting  a  change 

2 

in  the  model.  For  example,  suppose  Y.~N(pj,  <r  )  for  i  =  l,***,p, 

2  2 
Yi~N(H2.  <r  )  for  i  =  p+1,  •  •  *,  n  and  p^p2  where  pj  and  cr  are 

known  and  p2  may  be  unknown.  To  test  Hg:p=n,  Pj  vs.  H*:  p<n, 

P2,  p2,  p1>p2.  transform  Y.  by  the  rule 


X. 

i 


i£  Yi^l 

if  V  ^ 


Using  the  statistic  ,  test 


H^:  p  =  n,  p  vs.  Hj  :  p<n,  p,  q,  p>q 
00 


with  p  =  .  5  and  q  = 


f 


-(y-p  )2/2o-2 

e  dy.  If  H1  is 
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rejected,  then  reject 
and  estimate  ^  by 


If  Hq  is  rejected, 
n 

2  Y.  . 
i=£+l  1 


estimate  p 


as  above 


1.4  Literature  Review 

The  one-sided  and  two-sided  tests  for  detecting  a  change  point 
proposed  above  are  intuitively  appealing,  but  as  one  might  suspect 
the  properties  of  these  tests  are  extremely  tedious  to  evaluate.  Un¬ 
fortunately,  it  seems  as  though  any  test  statistic  one  might  consider 
as  a  reasonable  candidate  for  testing  for  a  change  point  suffers  from 
the  same  drawback. 

Consider  for  example  the  generalized  likelihood  ratio  test.  The 

likelihood  function  for  X, ,  •  •  • ,  X  is 

1  n 

1+x.  1-x. 

i  i 


n  p  { 1  -p)  if  p  =  n  , 

i=l 

1+x.  1-x.  1+x.  1-x. 

i  i 

p  — o —  — 7 —  n  — 5 —  — 5 — 

n  p  (1-p)  n  q  (1-q)  if  pc  {1,  . . .,  n-l> 

i=l  i=p  +  l 

1+x.  1-x. 

i  _ i_ 

and  n  q  2  (1-q)  2  if  p  =  0  ; 

i=l 


the  generalized  likelihood  ratio  when  p  and  q  are  known  is 
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n  p  d-p) 

i=l 


sup  <  n  p  2  (i-p)  2  n 

r  I  i=l  i=r+l 


2  2 

n  q  (1-q) 


inf  <  n 
r  i=r+l 


n  (f)  2  <f*>  2 

-x,  q  >-q 


C  «  =  cn  „  •*  *  .  XJ  =  2  lo8  X 

p»  q  p.  q  i  n 


n 

1.4.1  =  inf  {(n-r)  log  (E  •  r-^E)  +  (  X  x  )  log  (E  .  ) } 

q  i-q  q  i-p 


i=r+l 


q  1-p 


(When  r  =  n,  set  the  term  in  the  brackets  in  the  last  equation  equal  to 
0. )  Both  p.  and  q  are  assumed  known  when  performing  the  generalized 
likelihood  ratio  test  H^:  p=n,  p  vs.  :  p<n,  p,  q.  Evaluation  of 
critical  regions  for  this  test  statistic  is  considerably  more  compli¬ 
cated  than  for  the  test  statistics  we  have  proposed;  note  that 

n  , 

X  X.  f  {n-r,  n-r-2,  •  •  • ,  -(n-r-2),  -{n-r)  },  log  (E  .  >  0  if  and 

isr+1  1  q  1-P 

only  if  p>q  and  log  (E  .  E)  >  o  if  and  only  if  |  p-j  j  <  |  q-i- 1  . 


Furthermore,  tabulation  of  critical  regions  for  the  generalized  likeli¬ 
hood  ratio  test  for  given  value  of  n  would  take  up  a  great  deal  of  space 
even  for  a  moderate  number  of  values  of  p  and  q;  the  critical 
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regions  for  our  test  statistics  depend  only  on  p  and  hence  can  be 
written  down  more  concisely. 

Since  the  generalized  likelihood  ratio  method  often  yields  tests 
with  good  properties,  it  is  conceivable  that  the  effort  required  to 
evaluate  critical  regions  may  be  worthwhile;  we  examine  the  test  for 
n  =  5  .  Since  we  are  assuming  both  probabilities  p  and  q  are 
known,  only  the  one-sided  test  is  of  interest.  Furthermore,  since 
Cp  q(xl’  *  *  *  *  xn)  =  ci_p  x  (  -xi>  •  •  -xn),  applying  symmetry  argu¬ 

ments  to  the  one-sided  test  with  p<q  will  provide  properties  of  the 
test  for  p>q  .  Thus,  we  will  consider  the  likelihood  ratio  test  for 
p<q  only.  Table  1  lists  the  critical  regions  for  the  test  with  p  =  .  1 
andq  =  .3,  .9  and  p  =  .  5  and  q  =  .  7,  .9  .  Included  in  the  listing  for 
each  p  and  q  are  the  values  c  of  the  statistic  in  increasing 
order,  the  probability  of  the  statistic  being  at  least  as  small  as  c  , 
the  sequences  yielding  that  value  c  ,  the  point  r  in  each  sequence 
where  c  is  attained,  and  the  number  of  -l's  in  each  sequence. 
(The  point  r  where  c  is  attained  is  usually  uniquely  defined; 
exceptions  may  occur  when  p  =  1  -q, )  Note  the  following  properties 
of  the  test  evident  from  the  equation  for  c  and  suggested  by  the 

p.  q 

behavior  of  the  test  from  Table  1. 

For  every  p<q  and  for  all  n,  c  <  0  ;  c  =0  when 

Pi  q  ~  p»  q 

X.  =  -IV  i  ;  c  attains  its  smallest  value  for  the  sequence  where 

i  p,  q 

x^  *  IV  i  and  the  infinum  occurs  at  r  =  0  .  For  any  p<q  those 


J 
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Table  1,  The  Generalized  Likelihood  Ratio  Teat  Statistic,  nr 5. 


c 

P(C.1,.3-C)  X1X2X3X4X5 

r 

#-l's  in 

p  =  .  1,  q  = 

.  3 

-4,7715 

.00001 

11111 

0 

0 

-3.8172 

.00010 

-11111 

1 

1 

-3.5989 

.00046 

1111-1 

0 

1 

111  -11 

0 

1 

11-111 

0 

1 

1-1111 

0 

1 

-2.8629 

.00127 

-i  -mi 

2 

2 

-2.6446 

.00370 

-1111-1 

1 

2 

-111-11 

1 

2 

-11-111 

1 

2 

-2.4263 

.00856 

111-1  -1 

0 

2 

11-11-1 

0 

2 

1-111-1 

0 

2 

1-11-11 

0 

2 

11-1-11 

0 

2 

1-1-111 

0 

2 

-1.9086 

.01585 

-1  -1-111 

3 

3 

-1.6903 

.03043 

-1  -111-1 

2 

3 

-1  -11-11 

2 

3 

-1.472 

.05230 

-111-1  -1 

1 

3 

-11-11-1 

1 

3 

-11-1-11 

1 

3 

-1.2237 

.08146 

11  -1  -1-1 

0 

3 

1-11-1  -1 

0 

3 

1-1  -11-1 

0 

3 

1-1-1-11 

0 

3 

-0.9543 

. 14707 

-1-1-1-11 

4 

4 

-0.7360 

.21268 

-1  -1  -11-1 

3 

4 

-0.5172 

. 27829 

-1  -11-1  -1 

2 

4 

-0.2994 

.  34390 

-11-1-1-1 

1 

4 

•0.0811 

.40951 

1  -1  -1  -1  -1 

0 

4 

0 

1.00000 

-1  -1  -1-1-1 

5 

5 

5 


5 


-<fV  ! 
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Table  1  (Continued). 


P(C.1..9£Cl  x]'  * '  *’  x5 


p  =  l.  q  =  .9 
-9.5425  .00001 

-7.9400  .00010 

-5.7255  .00127 


-3.8170  .04096 


-1.9085  .13978 


11111 

0 

-11111 

i 

1111-1 

0 

111-11 

0 

11-111 

0 

1-1111 

0 

-i  -mi 

2 

-lin-i 

1 

-in-11 

1 

l-i-in 

3 

-11-in 

1.3 

-l  -l  -in 

3 

ln-i-i 

0 

li-ii-i 

0 

l-in-i 

0,2 

n-i-ii 

0,4 

l-ii-ii 

0,2 

l-i-i-n 

4 

-li-i  -li 

4 

-l-in-i 

2 

-l  -n-ii 

2,4 

-l-i-i-n 

4 

11-1  -l  -i 

5 

i  -n  -l  -l 

5 

-ui-i  -l 

1,5 

l-i-n-i 

3,5 

-li-ii-i 

1,3 

i  -l  -l  -i  -i 

5 

-li-i  -l  -l 

5 

-l  -li-i  -l 

5  . 

-l  -l  -li-i 

3,5 

-l  -l  -l  -l  -l 

5 

0 


1.00000 


Table  1  (Continued). 


c 

p(C  _<c) 

•  #  ( 

xr  ****  x5 

r 

#  -1'  s  in  x  ».,x  1 

P  *  •  5,  q 

=  .7 

-1.4615 

.03125 

mil 

0 

3 

1  i 

i 

-1.1692 

.06250 

-mil 

1 

i  ! 

-0.8769 

. 12500 

l-ini 

2 

1  : 

-l-uii 

2 

2  ! 

-0.7255 

.21875 

liu-i 

0 

1 

in-11 

0 

1 

11-in 

0 

1 

-0. 5846 

. 31250 

i  -l  -in 

3 

2 

-11-in 

3 

2 

-l-i-iii 

3 

3 

-0.4332 

. 37500 

-lin-i 

1 

2 

-in-11 

1 

2 

-0.2923 

. 50000 

li-i-ii 

4 

2 

l-ii-u 

4 

2 

l-i-i-n 

4 

3 

-li-i-ii 

4 

3 

-0.1409 

.62500 

l-in-i 

2 

2 

-l  -ni-i 

2 

3 

-l-ii-u 

2 

3 

i 

i 

H 

1 

1— » 

1 

1— • 

2 

4 

0 

1.00000 

111-1-1 

5 

2 

11-11-1 

5 

2 

11  -1  -1  -1 

5 

3 

1-11-1-1 

5 

3 

-111-1-1 

5 

3 

1  -1-11-1 

5 

3 

-11-11-1 

5 

3 

1  -1  -1  -1  -1 

5 

4 

-11-1-1-1 

5 

4 

-1-11-1-1 

5 

4 

-1  -1  -11-1 

5 

4 

-1-1 -1-1-1 

5 

5 

j 

< 

t. 

! 

j 

....  _ J 
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Table  1  (Continued). 


c 

P,C.5..^Cl 

X1  •*'X5 

r 

#  -1 ' a  in 

p  =  .  5.  q 

=  .9 

-2.  5525 

.03125 

11111 

0 

0 

-2.0420 

.06250 

-11111 

1 

1 

-1.5315 

.  12500 

i  -mi 

2 

1 

-l-ini 

2 

2 

-1.0210 

.25000 

li-m 

3 

1 

i  -l  -in 

3 

2 

-li-m 

3 

2 

-l  -l-in 

3 

3 

-0.6441 

. 31250 

mi-1 

0 

1 

in-11 

1 

-0. 5105 

. 53125 

li-i-n 

4 

2 

l-ii-ii 

4 

2 

-in-11 

4 

2 

l-i-i-n 

4 

3 

-li-i-n 

4 

3 

-l-ii-ii 

4 

3 

-l-i-i-n 

4 

4 

-0.1336 

. 56250 

-mi-1 

1 

2 

0 

1 . 00000 

ni-i-i 

5 

2 

li-ii-i 

5 

2 

i-m-i 

5 

2 

n-i-i-i 

5 

3 

l-n-i-i 

5 

3 

-ln-i  -l 

5 

3 

i  -l  -n-i 

5 

3 

-li-ii-i 

5 

3 

-l  -m-i 

5 

3 

i  -l  -l  -l  -l 

5 

4 

-li-i-i-i 

5 

4 

-l  -li-i  -l 

5 

4 

-l-i-ii-i 

5 

4 

-l  -l  -l  -l  -l 

5 

5 
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sequences  with  fewer  -l's  tend  to  give  smaller  values  of  c^  and 

those  sequences  with  more  -l's  tend  to  give  larger  values  of  c^ 

Also,  those  sequences  with  fewer  -l's  tend  to  have  c  occurring 

Pi  1 

at  values  of  r  near  0  and  those  sequences  with  more  -l's  tend 
to  have  c  occurring  at  values  of  r  near  the  end  of  the  sequence, 

p.  q 

This  is  somewhat  alarming  since  when  is  rejected,  an  obvious 

estimate  of  the  change  point  is  that  point  at  which  c^  ^  is  attained. 

In  this  case,  estimates  of  change  points  would  generally  occur  at 

the  beginning  of  the  sequence  and  detection  of  changes  late  in  the 

sequence  would  be  highly  unlikely  or  impossible  for  any  reasonable 

size-c*  test  for  many  values  of  p  and  q  .  A  closer  examination  of 

this  dilemma  will  be  made  in  later  chapters.  A  comparison  of  the 

power  of  the  test  based  on  C  and  S  will  be  made  in  Chapter  3. 

p,  q  n 

The  generalized  likelihood  ratio  test  statistic  C  has  not 

p,  q 

been  studied  for  finite  sample  sizes  in  the  literature  nor  have  tables 
been  made  available  for  its  use  in  applications. 

Asymptotic  results  using  likelihood  ratio  test  statistics  for  a 
change  point  problem  for  independent  Bernoulli  random  variables  have 
been  obtained  by  D.  Hinkley  and  E.  Hinkley  (1970),  but  their  setup  of 
the  problem  is  somewhat  different  than  ours.  Their  two-sided  test 
hypothesis  is  H^:  p  =  p^  versus  :  p  t  p^  where  p^<n  and  their 
one-sided  alternatives  are  H'^ :  P<Pq  and  H":  p>Pq  where  p^Cn  . 
They  assume  that  a  change  actually  took  place  and  are  testing  whether 
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the  change  took  place  at  p^;  on  the  other  hand  we  are  testing 
whether  any  change  has  taken  place,  that  is,  if  p  =n.  An  iterative 
solution  for  the  asymptotic  distribution  of  their  likelihood  ratio  test 
statistic  is  obtained  using  random  walk  results.  For  practical  pur¬ 
poses,  the  solution  is  of  limited  use  for  two  reasons.  Firstly,  a 
great  deal  of  computation  is  required  for  finding  critical  regions  for 
the  tests  by  using  the  iterative  formula.  Secondly,  the  sample  size  n 
and  the  difference  n  -p^  must  be  quite  large  for  the  asymptotic 
formulas  to  be  accurate;  when  p  and  q  are  unknown,  n  and  n-p^ 
must  be  even  larger  in  order  to  first  estimate  p  and  q  ,  and  then 
use  the  formula. 

Possibly  the  most  well-known  and  widely  used  statistic  for 
testing  for  a  change  point  is  one  proposed  by  E.  S.  Page  (1955).  His 
test  statistic  for  the  one-sided  test  with  p<q  is 

1 . 4. 2  M  =  max  [s  -  min  S.  ] 

0<r<n  r  0<i<  r  1 

r 

where  =  2  (X^  -(2p-l)),  r  =  1,  •  •  • ,  n  and  =  0  .  (The  mean 

i  =  1 

of  X^  under  the  null  hypothesis  is  2p-l;  for  other  distributional 

setups,  the  same  statistic  M  is  used  but  S  is  replaced  by 
r 

2  (X^  -  0)  where  0=  E(X.  |H^).)  The  null  hypothesis  for  a  size  - 

i=l 

test  is  rejected  if  M>h  where  h,  is  such  that  P(M>h  p  )<o<. 

Visually  this  amounts  to  graphing  Sr  vs.  r  and  rejecting 
wherever  an  rises  a  distance  h^  above  the  previously  plotted 

J 
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minimum  value  on  the  graph.  Intuitively  one  expects  the  graph  to 

rise  when  an  increase  in  the  mean  occurs  and  to  remain  horizontal 

when  there  is  no  change.  When  M  occurs  at  a  unique  point  r^, 

then  the  estimator  p  of  p  is  taken  to  be  the  most  recent  value  where 

min  S.  occurs;  when  M  does  not  occur  at  a  unique  point.  Page 
0^Cr0  1 

has  not  indicated  a  unique  choice  for  the  estimator  p  . 

Page  has  computed  some  critical  regions  for  the  test  for  the  par¬ 
ticular  case  where  the  X.  are  symmetrical  Bernoulli  random  variables 
and  has  computed  the  power  of  the  test  for  one  value  of  n.  He  has 
made  no  study  of  the  types  of  sequences  in  the  critical  region  for  cer¬ 
tain  sizes  of  tests  nor  has  he  studied  the  properties  of  the  estimator 
p  of  p  .  We  consider  the  latter  two  properties  crucial  in  evaluating 
change  point  tests.  The  case  where  the  X^  are  asymmetrical  is 
also  of  interest. 

To  study  these  properties  let  us  examine  the  test  for  n  =  5. 

Evaluation  of  critical  regions  requires  knowledge  of  p.  These  critical 

regions  are  more  tedious  to  evaluate  than  for  the  test  S"  ,  For 

n 

n  =  5  and  p  -  .  1,  .  5,  .  9,  Table  2  lists  the  values  m  of  the  statistic, 
the  probability  that  M  is  at  least  as  large  as  m,  the  sequences  yield¬ 
ing  that  value  m,  the  point  r  in  the  sequence  where  m  is  attained, 
and  the  number  of  -l's  in  each  sequence.  Note  the  following  proper¬ 
ties  of  the  test  evident  from  the  form  of  the  test  statistic  and  suggested 
by  the  behavior  of  the  test  from  Table  2. 


Table  2.  Page's  Statistic,  n=5 


f 

1  m 

P(M>m) 

x  .  •  •  »  x 

1*  *  5 

r 

#  -l's  in  x  x 

1  5 

[  p  *  -1 

L  9.0 

.00001 

11111 

0 

0 

;  7.2 

.00019 

-11111 

1 

1 

1111-1 

0 

1 

*  7.0 

. 00046 

i  -mi 

0 

1 

11-in 

0 

1 

in-11 

0 

1 

I  5.4 

.00289 

-i  -ini 

2 

2 

1 

-lin-i 

1 

2 

ni-i  -i 

0 

2 

!  5.2 

.00613 

n-n-i 

0 

2 

] 

i  -ni-i 

0 

2 

J 

-in-11 

1 

2 

-li-ni 

1 

2  j 

5.0 

.00856 

li-i-n 

0 

2 

l-n-ii 

0 

2 

l  -l-ni 

0 

2 

3.6 

.03772 

n-i-i-i 

0 

3 

-ni-i-i 

1 

3 

-l  -ni-i 

2 

3 

-l  -l  -in 

3 

3 

3.4 

.05959 

l-i-n-i 

0 

< 

3 

-n-i  -n 

1 

3 

-l  -li-n 

2 

3 

3.2 

.07417 

l-i-n-i 

0 

3 

-n-i  -n 

1 

3 

3.0 

.08146 

i  -l  -l  -n 

0 

3 

1.8 

.40951 

1-1 -l-i-i 

0 

4 

-n-i-i-i 

1 

4  ; 

-l  -n-i -l 

2 

4 

-l  -l  -n-i 

3 

4  j 

-l-i-i-n 

4 

4 

0.0 

1.00000 

-l-i-i-i-i 

0 

r 

5  ; 
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Table  2  (Continued). 


m 

P(M>m) 

X1’*“’X5 

r  #  -1 

1  s  in  x, 

P  =  -3 

7.0 

.00243 

11111 

0 

0 

5.6 

.01377 

-11111 

1 

1 

1111-1 

0 

1 

5.0 

.03078 

1-1111 

0 

1 

11-111 

0 

1 

111-11 

0 

1 

4.2 

.07047 

111-1-1 

0 

2 

-1111-1 

1 

2 

-1-1111 

2 

2 

3.6 

. 12339 

11-11  -1 

0 

2 

1-111-1 

0 

2 

-111-11 

1 

2 

-11-111 

1 

2 

3.0 

.  16308 

11-1-11 

0 

2 

1  -11-11 

0 

2 

1-1-111 

0 

2 

2.8 

.28656 

-1-1-111 

3 

3 

-1-111-1 

2 

3 

-111-1-1 

1 

3 

11-1-1-1 

0 

3 

2.2 

. 37917 

1-11-1  -1 

0 

3 

-11-11-1 

1 

3 

-1-11-11 

2 

3 

1.6 

.44091 

1  -1-11-1 

0 

3 

-11-1  -11 

1 

3 

1.4 

.83193 

1-1-1-11 

0,4 

3 

1-1-1 -1-1 

0 

4 

-11-1  -1  -1 

1 

4 

-1  -11-1  -1 

2 

4 

-1  -1  -11-1 

3 

4 

-1  -1  -1  -11 

4 

4 

0.0 


1.00000 


-1-1 -1-1-1 


0 


5 
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Table  2  (Continued). 


m  P(M>m)  xr*‘*,x5 


r  #  -l'a  in  x  x 
X  D 


P  =  .5 


5.0 

.03125 

11111 

0 

0 

4.0 

.09375 

1111-1 

0 

1 

-11111 

1 

1 

3.0 

.25000 

111-11 

0 

1 

11-111 

0 

1 

1-1111 

2 

1 

111-1-1 

0 

2 

-1-1111 

2 

2 

2.0 

.  59375 

11-11-1 

0 

2 

1  -111-1 

2 

2 

-1111-1 

1 

2 

11-1-11 

0 

2 

-111-11 

1 

2 

1-1-111 

3 

2 

-11-111 

3 

.  2 

11-1-1-1 

0 

3 

-111-1-1 

1 

3 

-1  -111-1 

2 

3 

-1-1-111 

3 

3 

1.0 

.96875 

1-11-11 

0,2,4 

2 

1-11-1  -1 

0,2 

3 

1-1-11-1 

0,3 

3 

-11-11-1 

1,3 

3 

1  -1 -1  -11 

0,4 

3 

-11-1-11 

1.4 

3 

-1  -11-11 

2,4 

3 

1-1 -1-1-1 

0 

4 

-11-1-1-1 

1 

4 

-1-11-1-1 

2 

4 

-1-1-11-1 

3 

4 

-1-1-1-11 

4 

4 

0.0 

1.00000 

-1  -1-1-1  -1 

0 

5 

7 
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For  any  p  and  n,  0  <  max  [s  -  min  S.]  <  2n(l-p); 

0<  r<n  r  0<  i<  r 

the  sequence  of  n  -l's  has  M  =  0  and  the  sequence  of  n  l's  has 
M  =  2n(l-p).  For  any  given  value  of  p  ,  those  sequences  with  fewer 
-l’s  tend  to  give  values  of  M  near  2n(l-p)  and  those  sequences 
with  more  -l's  tend  to  give  values  of  M  near  0.  In  addition,  for 
any  given  value  of  p,  those  sequences  with  fewer  -l's  tend  to  have 
M  attained  at  values  of  r  near  0  and  those  sequences  with  more 
-l's  tend  to  have  M  attained  at  values  of  r  near  n.  The  unfor¬ 
tunate  consequence  is  that  estimates  of  change  points  will  generally 
occur  at  the  beginning  of  the  sequence,  making  detection  of  later 
change  points  difficult.  This  same  predicament  was  noted  in  connec¬ 
tion  with  the  generalized  likelihood  ratio  statistic.  The  distribution 
of  p  for  Page's  statistic  for  n  =  5  will  be  studied  in  later  chapters. 
The  power  of  this  test  will  be  calculated  in  Chapter  3. 

A  disturbing  property  of  Page's  test  is  that  X  ,  •  •  • ,  and 

X  ,  •  ••,  X  give  the  same  value  of  M  .  To  see  this,  let 
n  X 

S  (X.  , • • • ,  X.  )  =  2  (X.-(2p-l))  ;  for  any  i<r  we  have 

r  ll  ln  1 

S,«Xr  •  •  •  ■  X„>  -  Si'V  •  •  ••  Xn>  =  {  S„(Xr  •  •  ■  •  xn»  -  Si(Xl-  •  •  •  •  xn” 

.  (S )  -  S  (X  ...,X  ))=  S  ,<X „.•••, x  >  - 
n  1  n  r  i  n  n-i  n  i 

S  (X  ,  •••,  X.)  ;  the  result  now  follows  trivially.  It  is  appropri- 
n-r  n  1 

ate  for  a  two-sided  test  for  a  change  point  to  behave  in  such  a  manner, 


V 
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but  for  the  one-sided  test  one  is  interested  in  testing  for  a  change  in 
only  one  direction.  It  seems  as  though  Page's  statistic  is  actually 
testing  something  other  than  what  is  proposed. 

1.5  Summary 

Nonparametric  one-  and  two-sided  testa  are  proposed  for  detect¬ 
ing  a  change  in  the  location  parameter  for  a  sequence  of  independent 
random  variables.  The  aim  is  to  obtain  a  test  for  small  n  which  is 
easy  to  use  and  which  will  detect  a  change  quickly. 

As  is  characteristic  of  other  change  point  tests,  properties  of 
the  proposed  tests  are  extremely  tedious  to  evaluate.  The  method 
employed  for  deriving  the  distributions  of  the  test  statistics  and  change 
point  random  variables  under  the  null  and  alternative  hypotheses  is  to 
first  obtain  expressions  for  their  joint  distributions.  Efficient  pro¬ 
cedures  facilitating  this  task  are  presented.  Unfortunately  attempts 
for  finding  recursive  formulas  to  study  these  distributions  have  been 
futile;  large  sample  derivations  are  presented  for  the  +  1  largest 
values  of  the  statistics,  but  for  large  n  they  are  useful  only  when  the 
size  of  the  test  is  small. 

Properties  of  the  distributions  of  the  test  statistics  under  the 
null  hypothesis  are  studied  and  are  found  to  depend  heavily  on  p  ; 
critical  regions  for  the  one  -  and  two-sided  testing  situations  are  tabu¬ 
lated.  Techniques  for  testing  for  a  change  point  when  n  >  20  are 


discussed, 
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Analyses  of  the  change  point  random  variables  under  the  null 
hypothesis  reveal  the  advantage  of  the  proposed  test  statistics: 
whenever  the  null  hypothesis  is  rejected  each  pe  {l,***,n-l}  has 
positive  probability  of  being  estimated.  Hence  change  points  can  be 
detected  quickly.  The  generalized  likelihood  ratio  test  (1.4.  1)  and 
Page's  test  (1.4.  2}  both  lack  in  this  aspect:  for  each  of  these  tests 
detection  of  changes  later  in  the  sequence  is  generally  difficult  or 


impossible. 


CHAPTER  2 


NULL  HYPOTHESIS  DISTRIBUTIONS 

The  distributions  of  the  test  statistics  (1.  2.  2)  and  the  corre¬ 
sponding  change  point  random  variables  when  the  null  hypothesis  is 
true  must  be  derived  to  find  critical  regions  for  the  tests  and  to  study 
properties  of  the  estimator  of  the  change  point  p  .  Obtaining  any  of 
these  distributions  is  nontrivial,  as  one  might  suspect:  the  sample 
space  of  sequences  Xj,»»»,Xn  has  2^  elements!  Fortunately,  one 
need  not  generate  all  these  sequences  and  then  evaluate  the  statistics 
for  each  sequence  to  find  the  distributions;  a  more  efficient  procedure 
will  be  presented.  The  strategy  will  be  to  first  obtain  expressions  for 

the  joint  distributions  (S  ,  R  )  ,  (S+  ,  R+  )  ,  and  (S  ,  R  )  ,  from 

n  n  n  n  n  n 

which  the  marginal  distributions  are  then  derived  and  studied. 

In  the  first  section  relevant  sample  spaces  are  defined  and  tools 

developed  for  employing  the  procedure;  the  counting  procedure  for 

finding  the  distributions  is  then  presented.  The  marginal  distributions 

S+  and  S  are  studied  in  Sections  3  and  4.  In  the  next  two  sections 
n  n 

the  covariance  between  the  test  statistics  and  change  point  random 
variables  is  analyzed  for  the  one-  and  two-sided  testing  situations. 

The  change  point  random  variables  R^  and  R*  are  investigated  in 
Sections  7  and  8.  Large  sample  derivations  are  presented  in 
Section  9.  The  chapter  is  concluded  with  an  example. 
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2.1  Definitions,  Notations  and  Tools 

Define  the  following  sample  spaces. 


nn‘ 


n  = 

n  n 


x.  =  +1 


Sn^Xl* 


for  i  =  1,  •  •  • ,  n}  , 

*’Xn>  =  8*  Rn(Xl*  *  *  * ’  Xn)  =  j 


where  x. ,  •  •  ♦,  x  c  .O }  , 
l  n  n 

Ag  =  {  s:  Sn(xlt  ...,xn)  =  s  where  x^  •  • . ,  x^e  fij  , 
n 

and  Or  =  {j:  Rn(xlt  •••.xj  =j  where  x^—.x^  . 

n 

Similarly,  define  sample  spaces  for  S*  ,  R*  ,  and  Rr  . 

o(fl  )  =  2n  and  for  Xj,  •  •  • ,  xne  fln  ,  P^,  •  •  • ,  *n  |  P)  = 

pk(l-p)n"k  if  k  of  the  x.'s  are  1  and  n-k  of  the  x^s  are  -1  . 

There  are  {£)  sequences  x^,  •***xn  with  k  l's  and  n-k  -l's  in 

Q  .  k  =  0,  •  •  • ,  n  . 
n 

We  have  seen  that  s,  s+e  [0,  2]  and  s'e  [-2,  0],  but  unfortu¬ 
nately  it  appears  that  no  simple  scheme  exists  for  listing  the  values 

for  arbitrary  n  nor  does  a  formula  exist  for  °(Og  ),  o(  Og+)  °* 

n  n 

o(fl  -)  .  Each  of  the  latter  quantities  is  an  increasing  function  of  n 

for  n<20  and  o (fl  )  <o(Og+)  =  ©(A-g-)  •  Even  when  the  values 

n  n  n 

attained  by  the  statistics  are  known,  no  expression  exists  for  their 
probabilities  for  given  n  .  Explicit  expressions  for  the  most  extreme 
values  and  then  probabilities  have  been  derived,  but  are  of  limited  use 
unless  the  size  of  the  test  is  very  small.  Attempts  for  finding 
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recursive  formulas  to  list  more  of  the  larger  values  of  the  test 
statistics  and  their  probabilities  for  arbitrary  n  have  been  futile. 
The  difficulty  arises  in  being  unable  to  obtain  an  ordered  list  of  the 
values  of  the  statistic;  as  n  increases  the  manner  in  which  the 
values  are  attained  also  increases  and  hence  an  ordering  of  the 
values  as  functions  in  n  is  impossible.  The  procedure  presented 
in  the  next  section  will  more  clearly  illustrate  the  nature  of  this 
difficulty. 

For  each  je{0,  1,  •  •  •,  n)  there  is  at  least  one  sequence 

x.,  •  •  •,  x  e  f\  such  that  J  (x.,  •  •  •,  x  )  =  j  ;  thus  /"l  = 

Inn  n  1  n  R 

n 

{o,  1,  •••,n}.  The  same  statement  is  true  for  O  +  and  (~1  -  . 

R  R 

n  n 

Note  that  f\ _  X  O.  -  f  -O.  _  _  .  In  general,  for  those 

b  K  9  I  A 

n  n  n  n 

sequences  x, ,  •  •  • ,  x  with  S  (x,,  •  •  • ,  x  )  =  s  ,  R  (x, ,  •  •  • ,  x  )  can 
l  n  ni  n  nX  n 

assume  only  a  few  values  of  j  .  Similar  statements  hold  for  S*  , 

R+  ,  S'  ,  R*  . 
n  n  n 

The  following  notation  will  be  needed.  Define  c^(s,  j)  to  be 

the  number  of  sequences  x^,  k  1 1  s  and  n-k  -l*s  that 

have  S  (x. ,  •  •  • ,  x  )  =  a  and  R  (x, ,  •  •  • ,  x  )  =  j  .  (If  x, ,  •  •  • ,  x  has 
nXn  nln  In 

Jn*Xl'  ***’Xn)  =  h*  *“’  then  Xr’‘,,Xn  contributeB  f  to 
i  =  Thus 


n 


2.1.1  P{(S  ,  R  J  =  (s,  j)  I  p}  =S  c,(s,  j)  pk(l-p)n_k 


n  n 


k=0 
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Define  c^(fl)  =  23  c^(»,  j)  so  that 

j=0 

n  .  , 

2.  1.2  P(S  =  a  Ip)  =  E  c,  (s)  p  (l-p)n‘ 

n  '  k=0  k 


and  C,  ( s  )  =  £  c,  (s)  so  that  s  is  the  smallest  value 

k  d  k  cx  n 

’  s>s_  ' 

-  °S  p 

of  s  satisfying 


2.  1.3  o<  >  E  Ck(8<x  p)  p  (l-p)n'  . 

Define  d  (j)  =  £  c  (s,j)  so  that 

•<ns 

n 

n  k  k 

2.1.4  P(R  *j|p)  =  23  dk(j)  p  (l-p)n‘  . 

*'  k=0 

Similarly,  define  c*  ( s+.  j)  ,  ck(s+),  )  ,  d*  (j)  ,  c‘(s‘,j), 

V*'*’  Ck(Sp)  and  d'k(j)  * 

The  procedure  for  obtaining  the  joint  distributions  involves 

•f'  ^ 

counting  the  ck(8,j)'  s  and  c^  {s  ,  j)'s.  We  establish  some  results 
useful  for  that  purpose.  The  first  three  results  deal  with  the  random 
variables  associated  with  the  one-sided  test  vs.  H*  . 

Lemma  1 .  For  each  n  >  1  and  x. ,  •  •  • ,  x  €  f\  ,  S+  (x, ,  •  •  • ,  x  )  = 
. -  —  t  n  n  n  i  n 

S+  (  -x  ,  •  •  • ,  -x. )  ;  if  S+  (x x  )  occurs  at  r  =  j  then  S+  (  -x  , 
nn  l  nln  nn 


•  ••,  -x^)  occurs  at  r  =  n-j 
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Proof.  Note  that  —  2  x.  - 

-  r  L=1  1 


1 

n-r 

n 

Z  x  =. 
i=r+l 

1  r+1 

—  2  ( 

n-r 

i=n 

1'” 

•  .  X  )  =  sup 
n 

{-Ex. 
r  i=l  1 

r 

r+1 

(-*.)  -  — 

1 

2 

2  (-x  )  } 

i=n 

L  X 

i=r 

n-r  , ,  i  n-r  i  r  i 

i  =  r+l  r  i=n  i=r 

S+  ( -x  ,  •  •  • ,  -x , )  .  Also  note  that  if  S+  (x.,  •  •  • ,  x  )  occurs  at 
n  n  1  n  i  n 

r  =  j  then  S+  { -x  ,  •  •  • ,  -x, )  occurs  at  r  =  n-j  .  _ 

J  n  n  1  B 

Lemma  1  provides  the  basis  for  a  number  of  useful  results. 

Theorem  2,  For  each  neZ+  , 

...  +  .  +  +  .  +  ..  +  ,  +.  +  .  +. 

(i)  ck  (s  ,  j)  =  cn_k  (s  ,  n-j)  ,  ck  (s  )  =  cn  k(s  )  and 

=  v.-«as+  .  jenR+  . 

r  r  n  n 

pc  [0,  l]  and  k«  {0,  1,  •  •  n  }  ; 

(ii)  P(S+  =  s+  Jp)  =  P(S*  =  s+  |  1-p)  Vs+€-Os+ 

n 

and  pc  [  0,  l]  ; 

( iii)  =  dn-k  =  dk  =  dn-k  V'’’  ke  t°»  l>  ’ '  * »  n) 

.„d  iiv)  P(R;  =  j  |P)  *  p(R;  =  n.j|up)  vjerv  an<J  pcl0>11 


Proof,  (i)  Refer  to  the  proof  of  Lemma  1  and  note  that  if  x^, 


-X1  ha8  n'k  1,8  * 


has  k  l's  then  -x 


28 


(ii)  P(S+  =  s+|p)  =  S  c+  (s+)  pk(l-p)n'k  = 

k=0 

^  cn  k  (S+;  (1-P)n"k  ^  =  P(Sn  =  S+  I1"?)' 

•  IX  “J\  IX 

k=0 

(iii)  Note  that  d*  (j)  =  2  c^(s+,  j)  and  c^(s+,  j)  = 

s+ens+ 

n 

+  .  +  .  . 

Cn-k  8  'n-j)  ' 

n  v  , 

(iv)  P(R^=j|p)  =  2  dk(j)p(l-p)n 

k=0 


n 


+  k  ,  n-k  _  .+  k  n-k 

=  £  d  (n-j)  p  ( 1  -p)  =  2  d  (n-j)  (1-p)  p 

k=0  n'*  k=0 


=  P(R*  =  n-j  j  1-p)  .  g 

+  + 

Note  that  (i)  implies  that  we  need  only  find  the  (s  ,  j)’s  for 
k<^j  or  for  to  evaluate  the  joint  distribution.  The 

numerical  computation  of  critical  regions  need  only  be  carried  out 
for  pe(0,i  by  (ii).  The  distribution  of  R+  is  symmetrical  in  j 

w  J  XX 

for  p  =  i  by  (iv). 

The  next  theorem  provides  upper  and  lower  bounds  for  the 

,  +  a  as  a  function  of  the  number  of  l's  in  the  sequence, 

values  a  e  i  2_+ 


Theorem  3.  For  any  n>2  and  ke  {l,  •  •  •,  n-l},  max  S  (X,,»»»,X  ) 
1  n  l  n 

=  2  and  min  S+  (X .,•••,  X  )  =  I  —  ^  I  where  X ,,•••,  X  is  any 
n  1  n  I  n  I  1  n  7 

sequence  with  k  l's  ;  furthermore  there  exist  sequences  of  k  l's 


which  attain  this  maximum  and  minimum 


1 


Proof.  For  kf  {l,  consider  the  sequence  x  ,  •••.x  with 

x.  =1  for  i  =  1,  • • • ,  k  and  x.  =  -1  for  i  =  k+1,  •  •  • ,  n  ;  S+  (x, ,  •  •  • 
i  l  n  1 

x  )  =  2  .  For  ke  {l,  •  •  •,  n-l}  consider  the  sequence  x.,  •  •  x  wil 
n  In 

x^  =  -1  for  i  =  !,•••,  n-k  and  x^  =  1  for  i  =  n-k+1,  •  •  • ,  n  .  If 


k<  [yl  then  R+  (x_,  •  •  •,  x  )  =0  and  S+  (x,,  •  •  x  )  =  n  2k.  ; 
L2  J  n  l  n  n  1  n  n 

k  >  [  then  R+  (x  ,  •  •  • ,  x  )  =  n  and  S+  (x  ,  •  •  • ,  x  )  =  n  ; 
[  2  J  n  l  n  n  l  n  n 


2k-n 


if  k  :  ?  then  J+  (x,,  *  •  •,  x  )  =  0  or  n  and  S+  (x,,  •  •  •,  x  )  =0 
2  n  1  n  n  1’  n 


Theorem  4.  Lemma  1  is  true  with  S  replacing  S  .  Theorem  2 

is  true  when  the  '+'  superscripts  are  replaced  by  '  superscripts. 

Theorem  3  is  true  with  min  S  (X,,  •  •  ♦,  X  )  =-2  and  max  S  (X.,***, 

n  1  n  n  l 

X  )  =  -  . 

n  n 

i  1  ^  1  ^ 

Proof.  Note  that  S  (x,,  •  •  • ,  x  )  =  sup  {—  £  x. - £  x.  } 

n  X  n  w  r  .  .  i  n  ■■  r  .  «  i 

r  i=l  l=r+l 

i  r  i  n 

=  -  inf  {-  £  (-x  )  -  -  £  (-x  )>  =  S  (-x  ,  -x  ) 

r  r  LI  1  n'r  i=r+l  1  n  1 


vx  ,  •  •  •,  x  e  Cl  • 

i  n  n 


Note  that  c^  ( s+,  j)  =  Cn_k  ( -s+,  j)  ,  ( s+)  =  cn.jc(  _s+)  » 

<  <*+.  p1  *  Cn-k  <-8+,  p»  “d  PK  =  s+  I  P>  =  P<Sn  -  "+  I 
Vs+€  Hg+  .  je.OR+.  kc  { 0 L ,  •  •  • ,  n}  pe  [0,  l]  and  neZ+  . 


Due  to  the  perfect  symmetry  between  the  tests  vs.  and 

H  vs.  H,  and  the  distributions  related  to  these  tests,  any  statement 
0  1 


1 
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made  about  the  former  can  be  correspondingly  made  about  the  latter. 
Hence,  we  will  discuss  only  the  former  in  what  follows. 

Results  similar  to  Lemma  1  and  Theorem  2  are  presented  next 
for  the  random  variables  associated  with  the  two-sided  test. 


Lemma  5.  F or  each  n  >  1  and  x, ,  •  •  • ,  x  t  i~l  ,  S  (x,»  » » • ,  x  )  = 
-  —  —  1  n  n  n  l  n 

S_(  -xi»  *•*»  -XJ  =  s(x>  •*•.*,)  =  S  (-x  ,  -x,);  if  S  (x  ,  •  •  • ,  x  ) 

n  l  nnn  inn  i  ni  n 

occurs  at  r  =  j  then  S  ( -x. ,  •  •  • ,  -x  )  occurs  at  r  =  j  ,  and 

n  i  n 

S  (x  ,  •  •  • ,  x  )  and  S  ( -x  ,  •  •  • ,  -x, )  occur  at  r  =  n-j  . 
n  n  1  n  n  1 


Proof. 


1_ 

r 

1_ 

r 


r 

1  n 

1  r 

-  2  (  -x. )  - 
r  i  =  l  1 

1 

n-r 

n 

S 

i=l 

X. 

1 

- £  x 

n-r  •  , ,  1  , 

i-r+1 

= 

2  ( -x  ) 

i=r+l 

r 

S 

1  n 

— —  s  v 

1  r+1 

£  x  - 

1 

1 

£  x  1  V  r 

i=l 

1 

n-r  ,.  l 

i=r+l 

n-r  i 

i=n 

r 

u  A. .  1  V  1 

1 

i=r 

Theorem  6.  Fpr  each  neZ  , 


(i)  ck(s,  j)  =  ck(s,  n-j)  =  cn_k(s,  j)  =  cn_k(s,  n-j),  ck(s)  =  cn_k(s) 


R  ’ 
n 


and  S^.p*  =Cn-k(8«,p)  =Ck(klV  VSenSn’  jeni 
pe  [0,  1]  and  ke{0,  1,  •  •  n}; 

(ii)  P(Sn  =  sjp)  =  P(Sn  =  sj  l-p)VscAs  and  pc  [  0,1]; 

n 

(iii)  dk(j)  =  dn_k(j)  =  dn_k(n-j)  =  dk(n-j)  Vj,  ke  {0,  1,  •  •  • ,  n}  ; 

and  (iv)  P(R  =  j  I  p)  =  P(R  =  n-j  Ip)  =  P(R  =  j  I  1  -p)  =  P(R  = 

n  i  n  i  n  i  n 

n-j  |  l-p)  V  je  Qr  and  pe[0,  1]  . 
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The  proof  of  Theorem  6  follows  directly  from  Lemma  5  and  the 
definitions. 


By  (i)  all  of  the  c^s,  j)'s  can  be  found  by  evaluating  those  for 
which  k<|^  —  j  and  j<  —  J.  By  (ii)  numerical  computation  of  critical 
regions  need  be  carried  out  only  for  pe(0,^]  .  The  distribution  of 


R  is  symmetrical  in  j  and  p  by  (iv). 
n 


The  upper  and  lower  bounds  for  values  in 


are  as  follows. 


Theorem  7.  For  n>2  and  k  e  { 1,  •  •  • ,  n-l),  max  S  (X  ,  •  •  • , X  )  =  2 
-  n  1  n 

where  X  ,  •  •  • ,  X  is  any  sequence  with  k  -1 1  s;  for  each  k  there 
1  n 

exists  a  sequence  which  attains  this  maximum. 

For  ke  {l,  .  •  • ,  n-l}  let  L  (k)  and  LJk)  denote  min  S  (X,  ,••• 
X^)  when  n  is  even  or  odd,  respectively,  and  X^,  •••,X  is  any 
sequence  with  k  -l's.  We  have 

L  (k)  =  L  (n-k)  =  for  k  =  1,  •  •  • ,  |  , 

e  e  n  L  4  J 


LJk,  =  L^n-k)  fork.  |  , 

LJkl  .  LJn-k)  .  for  +  . 

Lo(k)  .  L0(o-k)  =  k.l,...,  [2^]  . 

Lo(k)  =  Lo(n-k)  for  k  .  ii  . 

and  L0(k)  =  Lo(n-k)  =  for  k,  . 


For  each  k  the  lower  bound  is  attainable 
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Proof.  We  establish  the  lower  bounds  for  n  even  only;  the  method 

of  proof  for  n  odd  Is  similar. 

Li  (k)  =  L  (n-k)  for  ke  { 0,  1,  •  •  • ,  n}  by  Lemma  5, 
e  e 

Note  that  if  k<  I  ~  1  and  x,  =  -1  then  S  (  -1,  x_ ,  •  •  • ,  x  )  >  1  . 

—  L<-  J  1  n  4  n  — 

We  are  interested  in  finding  lower  bounds  so  in  what  follows  assume 
X!  =  1  * 


fn-1  1 

|  1 

n 

n-2k 

-3—  ; 

- 

2  x.  I 

=  -  so 

L  4  J 

1  n 

i=i  l! 

n 

Suppose  k  =  —  .  If  x,  =  1  then  =-  £  x. 


n-2  -2k 


2k  n 

n-2  -  2(n-2) 


2  .  i  n-2 

i  =  l 


2  Xi  = 
i=3 


If  x^  =  1  then  £  x^ 


£  x. 


n-2  -2(k-l) 


n-2k  _  n 

n-2  2(n-2) 


Hence  L  (£)  >  yy 
e  4  —  2(n-2) 


|i 

1 

n 

f  +  1<  k<  j  ; 

£  x.  -  ~ 

£  x.  I 

L  4  J  -  -  2 

1  1 

1*1  1  n-X 

i=2  1  ' 

=  1  - 


n-1  -2k 


— r-  .  Hence  L  (k)  >  — -  . 

n-1  e  —  n-1 


We  show  that  for  k< 


[¥] 


that  L  (k)  is  attainable.  If 
e 


k  is  even  consider  the  sequence  with  x.  =  1  for  i  =  1,  •••, 

1  u 


x.  =  -1  for  l  =  +  1, 

1  fa 


n+k  ,  f  •  n+k  .  , 

■ ,  and  x.  =  1  for  i  =  +  1,  •  •  • ,  n. 

Ci  l  fa 


For  this  sequence  S  may  occur  only  for  r  =  0,  ,  n  by 

71  b  fa 

Theorem  15.  To  verify  that  for  this  sequence  occurs  at  r  =  0 


is  straightforward.  If  k  is  odd  construct  a  sequence  with  =  1  for 
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n-k-1  .  n-k-1  n+k-1 

1  =  1,  •  •  • ,  — = —  ,  x.  =  -1  for  i  =  — • —  +  1,  •  •  • ,  — - — 

4  i  4  2 

,  if  n+k+1 

and  x.  =  1  for  i  =  — = - ,  •  •  • ,  n  . 

1  M 

Lastly,  we  show  that  for  k=^,  Lg(k)  is 

attainable.  Construct  the  following  sequence:  x  =  x  =  1,  x  = 

1  n  2 

Xn-1  =  -1  '  Xi  =  1  f°r  1  =  -[t1]  and  1  =1  +  [Ti]+  1  ' 

••  •,  n-2  ,  and  if  k  is  odd  x.  =  1  for  i  =  S  and  x  =  1  for  all 

1  i  l 

remaining  i  or  if  k  is  even  x.  =  -1  for  i  =  j  J  +  1,  •••, 

^  +  [— -T~J  •  ^  *or  this  sequence  may  occur  only  for  r  =  0,  n,  1, 

,  ,  ,  n  fk-ll  n  rk-n 

n-l.Z.n-Z,^  -  ^ — J  *  2  +  L-! — J  “  k  is  even  and  a.vso  for 


r  =  2  -  j.,  2  if  k  is  odd  .  | 

To  obtain  C\  _  and  n  *  our  intention  is  to  construct  A 
5  S  r 

n  n 

as  defined  in  the  next  theorem. 


% 
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n 


A  =  {  —  S  x.  -  — 
n  r  .  .  i  n-r 
i=l 


fL+  =  A 

S  n 

n 

n 

for  each  n>l  where 

E  x.  : 

i=r+l  1 

x  ,  •  x  e  CL  and 

I  n  n 

if 

1  r 
then  -  E 

n 

x.  -  —  £  X.  >  | 

n- 2k  1 

r  1=1 

i  n-r  , ,  i 

i=r+l 

n  i 

k  =  0,  •  •  • ,  n  and  r  a  0,  •  •  • ,  ]  }  . 

Proof.  For  any  sequence  x^,  ••  •,  x^c  A^ 

I  1  r  1  n 

S  (x,»  =  sup  -  S  X;  -  r-T  2  xi 

n  1  n  r  .  ,  i  n-r  i 

r  1  i=l  i=r+l 


=  max 


r,  r  i  n  1 

{sup  i.  E  x  -  A-  E  X 
r  *-  i=l  i=r+l  J 

T  1  n  1 

sup  I  E  (-x.)--i-  £  (-x) 

r  Lr  i-1  1  n'r  i=r+l  lJ 


=  max  (S(x*  •••,*  ),  S^(-x  ,  -x  )  }  . 
n  i  n  n  i  n 

Thus  A_  C  Ac+  . 

o  o 
n  n 

If  S+  (x, ,  •  •  • ,  x  )  occurs  at  some  j  >  ^  then  S+  ( -x  ,  •  •  • ,  -x, ) 
n  1*  '  n'  J  12  J  n  n’  1 

occurs  at  n-j<[£|  and  S+{  -x  ,  •  •  • ,  -x  )  e  A„  ,  but  S+(x  ,  •••,x)  = 
— Lc  J  nn  in  ni  n 

S*(-x  •  ■  • ,  -x  )  .  Hence  Ci^+C  A  . 

n  n  l  on 

n 

Next,  we  show  that  A  Cl  f\e+  .  We  must  show  that  for  each 

n  o 

n 

1  r  1  n  n  2k  I 

value  s  =  —  £  x.  -  - -  E  x,  >  — -  where  x,,  •  •  •,  x  has 

Or.,  i  n-r  ,  i  I  n  I  1*  ’  n 

i=l  i=r+l 
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k  l's  that  there  exists  a  sequence  with  k  l's  satisfying  =  s^  . 

If  x,,***,x  has  a  l's  in  the  first  r  places  and  n-k  l's  in  the 
In 

last  n-r  places,  consider  the  sequence  y  ,  •••,  y^  with  y^  =  -1 

for  i  =  1,*m,  r-a,  y^  =  1  for  i  =  r-a+1,  •  •  •,  r  ,  y^  =  -1  for  i  = 

r+l,  •  •  • ,  n-k  and  y.  =  1  for  i  =  n-k+1,  •  •  • ,  n  .  S+  (y,,  •  •  • ,  y  )  =  srt  . 

x  n  1  n  o 


The  set  inclusion  fl g  CX"lg+  is  proper  for  n  =  2,  •  •  • ,  20.  If 

n  n 

s+f  H  AH,,  and  if  s+  occurs  for  a  sequence  with  k  l's  then 

b  O 

n  n 

s+  <  s  for  all  a  which  occur  for  a  sequence  of  k  l's;  s+e  ,0g-l\ 


.O.c  does  not  imply  s  h<  s  Vse  fL  . 
b  b 

n  n 

One  might  suspect  that  for  8€.f~lg  .  ^  c^fs)  -  (s)  •  This 

n 

relationship  holds  for  some  k  and  s,  however,  the  set  of  such  values 
is  not  large  enough  to  be  useful. 


2.  2  The  Procedure 

The  procedure  used  to  find  the  joint  and  marginal  distributions 
is  now  presented. 

Notation.  We  need  to  be  able  to  describe  types  of  sequences  x  ,  •  •  •  x  . 
■  — ■■  ■  ■  ■  *  In 

By  the  expression  T^  (r,  a)  denote  the  set  of  all  those  sequences  of 
length  n  with  k  l's  where  a  l's  occur  in  the  first  r  places  and  k-a 
l's  occur  in  the  last  n-r  places. 
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For  example  T  (2,  1)  *  (1-111-1,  1-11-11,  1-1-111,  -1111-1, 

■111-11,  -11-111}  . 


The  Procedure 


1 


List  the  values  —  S  x. 

r  i=l  1 


1 


n 

£ 


x.  e  A  for  each  r 
n-r  J ,  i  n 
i=r+l 


systematically  by  varying  the  number  of  -1 ' s  in  the  sequence, 
each  value  include  the  types  of  sequences  yielding  that  value. 
For  r  =  0  we  have  the  following  list. 


1 


n 


0  -  -  £  x 

n  LI  1 


Types  of  Sequences 


1 

n-2 

n 

n-4 

n 


T°  (0,  0) 

T*  (0,  0) 

11 

T l  (0,  0) 


For 


{ 


0 

1_ 

n 


n 

2 

^n  ^  n  *8  even 

n-1 

Tj2  (0,0)  if  n  is  odd 


j 
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r  =  2  we  have  the  following  lists. 


1  2  1  n 

-  S  x —  Z  x 

2U1  1  n'2  i=3  1 

Types  of  Sequences 

1  -<-ir!)  =  2 

T2  (2,  2) 

n-4  2(n-3) 

1  n-2  '  n-2 

T*  (2,2) 

n-6  2(n-4) 

'  n-2  '  n-2 

• 

T*  (2,  2) 

• 

• 

n+2 

1 

Tn2  (2,2) 

if  n 

is  even 

n+1 

i  .  ( — —  )  =  Eli 

1  n-Z1  n-2 

Tn2  (2,2) 

if  n 

is  odd 

n+4 

2  _  n-4 

1  "  n-2  =  n-2 

Tfl2  (2,2) 

if  n 

is  even 

n+3 

1  n-3 

’  n-2  "  n-2 

Tfl2  (2,2) 

if  n 

is  odd 
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that  value.  The  information  for  r  as  found  from  I.  by 

inspection  must  be  included.  Do  the  same  for  #-l's  =  l,2,  • .  • ,  T 

Example  1.  The  procedure  for  n  =  5  is  demonstrated. 


+ 

s  r 


#  - 1  ’  s  =  0 


Types  of  Sequences 


V5' 5) 


2 

3 


#  -l's  =  1 


r  Types  of  Sequences 


4  5 


4 

2 


T5(5,  4) 
T5  (4»  4) 

T l  (2.2) 

T5  <3,  3) 


s+  r  Types  of  Sequences 


j  5 


#  -l's  =  2 


(5.  3) 


1  1 


1  « 
1  * 


T5(l.l) 


T5(4.  3) 


Tg  (2,2) 


2  3 


J  3 


T 5  (3,  3) 


Tj  (3,2) 


Note  that  for  #  -l's  =  j j+  1,  •  •  • ,  n  similar  listings  can  be 
found  from  those  for  #  -l's  =0, 


PJ|3 
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III.  Forgiven  n  the  c^(s+,  j)  *  s  and  c  (s,  j)'s  are  now  counted; 

K  K 

one  need  only  count  them  for  #  -l's  =  0,  1,  •  *  j  by  Theorems 
2  and  6  . 

Order  the  values  in  II.  within  each  category  of  #  -l's  from 
the  largest  to  the  smallest;  within  each  category  count  the  number  of 
sequences  yielding  each  value.  Include  the  point  j  at  which  the  value 
is  attained  and  the  types  of  sequences  yielding  that  value. 


+  ,  + 


Example  2.  For  n  -  5  the  c  ( s  ,  j)1  s  are  found  for  #-1*8=0,  1,2. 


#  -l's  =  0 

+  +  +  Types  of 

8  j  c  (s  *  j)  Sequences 


*}<5.  5> 


2 

1 

2 

3 

3 

5 


#  -l's  =  1 

+  +  Types  of 

c^s  ,j)  Sequences 


4 

3 

2 

5 


1 

1 

1 

2 


T5<4.  4) 
TgO,  3) 
Tj(2,2) 
T*(5,  4) 


#  -1*2  =  2 

;  ,  Types  of 

s  j  c  (s  ,  j)  Sequences 

2  3  1  T^(  3,  3) 

f  4  3  T^<4,  3) 

|  2  1  T*(2,2) 

1  1  2  T^(  1,  1) 

j  3  1  T^(  3,  2) 

~  S  2  T^( 5,  3) 
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In  particular,  consider  the  category  with  #  -1 1  s  —  1;  there  are 
5  + 

(j)  such  sequences.  The  largest  value  is  s  =  2  which  occurs  in 

2 

exactly  oneway,  namely  1111-1.  c^(2,  4)  =  1.  The  second  largest 

value  is  s  +  =  1.  There  are  two  sequences,  namely  111-11  and 
1  3  1  5 

1111-1,  such  that  —  Ex.  -  —  Ex.  =1.  Of  these  only  111-11 
3  i=l  1  2  =  1 


i=4 


satisfies  S*  =  1.  The  other  sequence  satisfies  S"^  =  2.  c+(l,  3)  =  1. 

5  5  4 

+  2 

The  next  largest  value  is  s  =  —  .  There  are  three  sequences,  namely 

]  2  j  5  2 

11-111,  111-11  and  1111-1,  such  that  j  E  x.  -  -  E  x.  =  -  .  Of 

2  .  1  3  .  „  i  3 

+  2  1-3 

these  only  11-111  satisfies  S  =  — .  The  other  two  sequences  have 

5  3 

\  .  c*(^-,  2)  =  1.  The  remaining  value  is  s+  =  J-  .  All  5 

j  i  4  3  5 


1  5  3 

sequences  satisfy  —  E  x.  =  —  .  Of  these  only  1-1111  and  -11111 

5  i-2  1  5 

satisfy  S*  =  .  The  other  sequences  have  .  c^(~  ,  5)  =  2  . 


5  5 

All  five  sequences  have  been  counted 


Example 

3.  For 

n  =  5  the  c^(  s, 

j)'s 

are 

found  for 

#  -l's  -  0, 

#  -r  s  = 

0 

#  -l’s  = 

J _ 

Types  of 

Types  of 

s  j 

c5(s,  j) 

Sequences 

s 

j 

c4(s>  j) 

Sequences 

1  0 

1 

2 

T*(0,  0)  j 

2 

4 

1 

T5  <4-  4) 

,1  5 

1 

_5  ,  1 

i  ^ 

4 

2 

Tg  (5,5)  | 

12 

1 

1 

t5<i,  0) 

< 

u 

2 

1 

Tg(2.  1) 

u 

3 

1 

T*(3,  3) 

• 

rl 

1 

4 

J 

3 

2 

2 

Tg(2,  2) 

j 

2 

1 

4 

1 

1  3 

3 

2 

T*(3,  2) 
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#  -l's  =  2 


Types  of 

j  c  (s, j)  Sequences 


(2,  0) 
T3  (3,  3) 
T3  (1,  0) 
Tg  (4,  3) 
(3,  1) 
(2,  2) 
Tg  (1.  U 
Tj  (4,  2) 


Consider  the  category  with  #  -l's  =  1.  The  largest  value  is  s  =  2 

which  occurs  in  exactly  2  ways,  namely  for  1111-1  and  -1111; 

c  (2,  4)  =  c_,(2,  1)  =  1  .  The  second  largest  value  is  s  =  1.  There 
4  4 

i  1  ^  j  ®  . 

are  four  sequences  satisfying  —  £  x. - S  x  =  1  or 

’  2  i  =  l  1  3  i=3  1  1 

1  3  1  5  I 

~  Z  x.  -  -  £  x.  =  1  ;  they  are  -11111,  1-1111,  111-11,  1111-1. 

3  i=l  1  i=4  1  1 

Of  these  only  1-1111  and  111-11  satisfy  S,.  =1.  The  other  sequences 
have  Sj.  =  2.  c  (1,  2)  =  c^(l,  3)  =  1.  The  third  largest  value  is  s  = 


1 

2 

1 

5 

2 

£  x. 

£  x. 

=  —  or 

2 

1 

"  3 

1 

3 

i  =  l  1  3  i=3  1  1  3  13  i=!  1 


~  £  x. 

2  .  .  i 

i=4 


Of  these  only  11-111  satisfies  ~  .  The 


UJ  (  M 
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other  sequences  have  >  —  .  J^(  11  -11 1)  =  2,  3 


Hence 


cj|,2)  =  c.(\  ,  3) 


‘4V  3 


4' 3 


£ 

2 


80  R  (11-111)  = 

.  All  five  sequences 


have  been  counted. 

The  joint  distributions  (S+  ,  R+  )  and  (S  ,  R  )  are  available 

n  n  n  n 

from  the  lists  constructed  in  III.  The  c*(s+,  j)*s  and  c^Js.jVs  are 

used  to  find  P((S+,  R+)  =  ( s+,  j)  I  p)  and  P((S+,  R+)  =  ( s+,  j)  I  p)  according 

n  n  1  n  n  1 

to  (2. 1.  1)  . 


IV.  The  marginal  di  stributions  S+  and  S  are  found  from  the  joint 

n  n 

distributions  in  III.  according  to  (2.1.2).  Since  these  distributions  will 
be  used  to  find  critical  regions  for  the  tests  vs.  h|  and  HQ  vs.  , 
the  s+  and  s  values  must  be  ordered  over  all  categories  of  #  -l’s  . 
The  terms  for  #  must  be  included. 

Example  4.  The  marginal  distributions  S*  and  are  found. 

P(Sg  =  s+  |  p) 

p(l-p)4  +  p4(l-p)  +  p2(l-p)3  +  p3(l-p)2 

2  3  3  2 

3p  (1-p)  +  3p  (1-p) 

2..  X3  X  3,i  \Z 

P  (1-P)  +  p  (1 -p) 

(1-p)5  +  p5  +  p(l-p)4  +  p4(l-p) 

2  3  3  2 

+  2p  (l-p)J  +  2pJ(  1  -p) 
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2  4  4 

-  P(  1  -p)  +  P  ( 1  -P) 

|  2p(l-p)4  +  2p4(l-p) 

1  2.,  3  ,  3..  ,2 

3  P  (1-P)  +  P  (!-P) 

1  ,  2n  ,3X,  3..  .2 

-  2  p  ( 1  -p)  +  2p  <l-p) 

s  P(S5  =  s  |  p) 

2  2p{  1  -p)4  +  2p4(  1  -p)  +  2p2(l-p)3  +  2p3(l-p)2 

~  5p2(;-p)3  +  5p3(l-p)2 

|  2p2(l-p)3  +  2p3(l-p)2 

1  (1-p)5  +  p5  +  2p(l-p)4  +  2p4(l-p) 

2  3  3  2 

+  p  ( 1  -p)  +  p  ( 1  -p) 

2  4  4 

-  p(  1  -P)  +  p  ( 1  -p) 


For  given  p  the  critical  regions  for  a  size-©<  test  may  now  be 
computed.  By  Theorems  2  and  6  the  critical  regions  need  only  be 
computed  for  p  <  i  . 

V,  Lastly,  the  marginal  distributions  R4  and  are  found  from 
the  joint  distributions  in  III.  according  to  (2.  1.4).  The  terms  for 
#  -l,s>[j]  must  be  included. 


. — —  i  hi  irlmra . MaaaailiiMi 
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2.  3  Properties  of 

Properties  of  S*  when  the  null  hypothesis  is  true  are  studied. 
Included  first  is  a  result  for  reducing  the  calculations  needed  to  evalu¬ 
ate  S+  for  a  sample  sequence.  An  analysis  of  o  (/"L  +)  follows.  The 
n  o 

n 

behavior  of  the  probability  density  and  its  mode,  mean,  variance  and 

median  are  investigated  for  varying  n  and  p  ;  critical  regions  for 

the  test  H  vs.  are  tabulated  and  P(S+>s+  I  p)  is  studied 

0  1  n  —  1 

as  a  function  of  p,  n  and  s+.  Indications  for  testing  when  n>20  are 
discussed. 

Theorem  10  is  useful  for  reducing  the  calculations  needed  to 
evaluate  S*  in  practice.  First  we  prove  Lemma  9. 

+  i  J  1  n 

Lemma  9.  If  S  (X  ,...,X  )  =  —  £  X.  - — r  £  X.  where 
xii  n  j  .  .  i  n  ^ 

j  i=i  j  i=j+i 

i  j-1  i  n 

j  4  0,  n,  then  -  £  X.  <  1  or  — -= — r-  £  X.  > -1  when  j>l 

'  j-1  .  .  i  n+l-j  .  .  i 

i=l  i=i 


and  t-V  £  X. 

J+1  1-1  1 


hen  t-L-  S  X.  <1  or  -  £  X.  > -1  when  j>l 

J-1  .  .  i  n+l-j  .  .  i 

i=l  J  i=j 

j+1  n 

£  X.  <  1  or  - ; — r  £  X.  >  -I  when  j<n-l.  If  j  =  1 

■  ,  i  n-l-j  .  i 

i=l  i=j+2 


1  n  1  n 

then  —  £  X.  >-l  and  if  j  =  n-1  then  —  £  X.  <  1 

n  .  ,  l  J  n  ,  l 

i=l  i=l 


+  i  j  i  n 

Proof.  Suppose  S  (X, ,  . . . ,  X  )  =  £  X. - -r  £  X.  and 

■  n  1  n  j  ,,i  n-j  .  . . ,  l 

J  i=l  J  i=j+l 


j>i  .  if  rV  2 
1  i=l 


j-1  !  n 

£  X.  =  1  and  — i-r  £  X.  =  -1  then 

.  ,  i  n+l-j  .  .  i 

i=l  i=j 


50 


_  1  •>  x  ,  n 

2  5?  -  y  I  ^  •) 

i  a+1  -j  2  Xi  •  *i»  implies  that  —  S  X 

i=j  j  i=l  1 

2  n 

n - i  .  2  ~  T"  (j~l-l)  -  — — -  f-(n-j))  s  2  -  —  <  ? 

J  i=j+l  1  J  n-j  1  '  •>''  *  j  <  2  ,  a  contra¬ 


diction. 


A  similar  srgumsa.  is  used  to  verify  the  remainiag  siafemenfs 
of  the  theorem. 

■ 

Theorem  1 0.  If  S+ (X  ,  y  i  .  JL  l  v  l  n 

“  1’  '  V  ~  J”  2  Xi  -  7TT  2  x-  and 

1=1  J  i=j+l  1 

j  /  0,  n  ,  then  =  1  aiid  X.  j  =  -1  . 


<■  ,  v  j  j”1 

V  ‘i*  ^  S  X  s  2  X  -  J 

j=l  Ul  1 


1  !  Y  1  n  ,  j-i  _ 

T“  £  X  —  — -  i, .  y  "v-  1  .  1  il 

1  i*»  *  "_i  uj+i  1  =  r  if,  i  ■ 11  -th  1  s. x,  +1 


j  /  0,  n  , 

then  X. 
J 

Proof. 

Suppose 

n 

n 

and  2 

X.  =  2 

i=j+l 

x  .  . 

X=j 

j  ,  n 

5  r  s  X.  -  -L  s  x  -  I  1 

J  isl  1  -i  Uj  i  f  in 

=  _i_  j‘v  1  " 

J'1  if,  ‘  •  ^rn-  .f.  xi 
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1  ,  1 


M 


1  ,  1 


4-  (  -r-V  2  X,  +  1 )  -  -4-  ( 

J  J-l  is:1  1  n-J  n-j+1 

1  j-1  j  » 

<  -rV  2  X. - ■  2  X. 

J-1  -  ,  i  n-j+1  .  .  i 

i=l  i=j 


n 

2  X.  +  1) 

.  .  i 

l=J 


1  i 

since  either  t-y  2  X.  <  1  or  - r-y  2  X.  >  -1  by  Lemma  9. 

J  i=l  1  J+1  i=j  1 

Thus  for  j  /  0,  1,  n,  if  X^  =  -1  then  R*(Xj,  X^)  yf  j,  a  contradic¬ 
tion.  Suppose  j  =  1  and  X^  =  -1;  then 


4  2  X 
J  i.l  1 


n 

2 


i=j  fl 


X.  =  -1  - 
i 


1 

n-1 


n 

2  X.  >  0 

:  i  _ 


l  n 

~ 7  2  X  =  -1  4.  X  =  X  = 

n_1  i=2  1  ^  2  3 


•••  =  X  =  -1  . 
n 


*f  -f 

But  X  ,  •  •  • ,  X  =  -1,  •  • . ,  -1  has  S  =1  and  R  =0,  a  contradiction, 
in  n  n 

Hence,  if  R+  (X  ,  •  •  • ,  X  )  =  j  where  j  4  0,  n  then  X.  =  1 . 

n  l  n  '  j 


A  similar  argument  shows  that  X^  =  "1  • 


Hence 

calculate  — 
r 


to  evaluate  for  a  sample  sequence,  one  need  only 
r  i  n 

2  *i  •  ~ ■  1  2  x.  for  those  re  {1,  •  •  • ,  n-1 }  where 

i=l  ~  i=r+l  1 


x  =  1  and  x  =  -1,  and  if  each  value  calculated  is  less  than  1  also 
r  r +i 

for  r  =  0  if  the  number  of  l’s  in  X2»'"»xn  >•  less  than  or  equal  to 
the  number  of  - 1  ’ s  or  for  r  =  n  otherwise. 
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No  formula  for  the  order  of  the  sample  space  S  has  been 


n 


es 


i=l  i=r+l 

with  n  .  Moreover  the  number  of  r  values  with  r< 


found,  but  it  is  of  interest  to  study  the  behavior  of  o(^g+)  as  a  func- 

n 

tion  of  n  .  o(Og+)  vs.  n  is  plotted  in  Figure  3  for  n  =  1-10,  15,20. 
n 

Note  that  o{f~lc+)  increases  rapidly  as  n  increases.  This  behavior 
n 

is  expected:  Consider  the  lists  constructed  in  I.  of  the  procedure  for 

counting.  For  given  r  and  for  each  a  the  number  of  types  of  se- 

_k.  ,  1  '  v  1  ^  v  ^  (n-2kl  . 

quences  T  (r,  a)  which  have  —  Z  X.  — ■ —  2  X  >  -  increas 

n  r  .  i  n-r  . _ _  i—  L  n  J 

— increas®s 

so  that  the  collection  of  lists  grows.  Also  as  r  increases  so  does 

the  number  of  types  of  sequences  with  a  l's  in  the  first  r  places. 

Many  of  the  values  obtained  using  the  procedure  for  given  n  are  unique. 

log2(o(Os+))  vs.  loggn  *8  plotted  in  Figure  4  ;  note  that  for 
n 

5-10.  15,20  the  graph  is  nearly  a  straight  line.  The  best  least  squares 

straight  line  fitting  these  points  is  log_(o(Oc+))  =  2.296  log  n  -  2.456, 

n  -2  456 

so  an  approximation  to  +)  may  be  found  by  o((~\  +)  =  2~  , 

1.2,6 


Next  we  investigate  the  behavior  of  the  probability  density  S+ 

n 

and  its  mode,  mean,  variance  and  median  for  varying  n  and  p  .  In 
Figures  5.1,  5.2  are  graphed  P(S+  =  s+  p)  vs.  s+  for  n  =  10  and 
20,  for  p  =  .  1,  .2,  .3,  .4,  .5  and  for  those  s  for  which 
P(S^  =  s+  |p)  >.01  .  The  probability  densities  for  p  =  .  6,  .7,  .8,  .9 
are  the  same  as  those  for  1  -  p  by  Theorem  2  and  hence  are  omitted. 


Figure  4.  log2  (o(flg+))  ve. 


Only  the  "large"  probability  masses  are  included  since  o(  +) 

n 

is  very  large. 

Note  that  for  each  n  plotted  as  p:  .1 - >-.5  the  number  of 

"large"  probability  masses  for  s*  near  2  decreases  and  for  s"*" 

4* 

near  0  increases  ;  the  s  values  with  "large"  probabilities  are 
shifting  downwards. 

Let  m+  =  P(S+  =  s+  Ip)  where  s  +  is  the  mode  of  S+ 

n,  p  n  m  i  m  n 

for  those  values  of  n  and  p  .  Note  that  sup  m+  =  m+  „  =  1 

p  n,  p  n,  0 

and  the  mode  occurs  at  s^  =  1  ;  the  only  sequence  with  positive 
probability  when  p  =  0  is  the  sequence  with  =  -lyi  .  When  p 

■f 

is  near  0  the  mode  occurs  at  s  =  1  and  p  must  be  nearer  to  0 
in  order  for  the  mode  to  occur  at  s+  =  1  as  n  increases.  Note  the 
similarity  in  the  shift  of  the  mode  as  p:  0 — >  ~  for  each  n.  As  p 

w 

increases  from  0,  m+  decreases  and  the  mode  shifts  to  values 

n,  p 

s  <  1  ;  then  as  p  increases  to  =- ,  m+  increases  and  the  mode 
m  2  n,  p 

shifts  to  values  s+  >  1  . 

m 

For  n  =  5-10,  15,  -0,  E(S*  |  P)  vs.  P  is  sketched  in  Figure  6. 

The  mean  is  symmetric  in  p  and  1-p  because  of  the  symmetry  in 

S+  ;  E(S+  |  p=Q)  =  1  for  all  n  . 
n  n  1 

For  each  n  plotted  E(S^  |  p)  decreases  as  p  increases  from 
0  and  then  increases  as  p  nears  —  .  A  similar  behavior  was  noted 


for  the  mode.  The  minimums  tend  farther  from  0  and  1  as  n 


increases  and  a  local  maximum  occurs  at  p  =  —  for  all  n  .  In 
general  as  n  increases  E(S*  |  p)  decreases  for  each  p  . 

Note  that  the  range  of  E{S+  p)  over  n  and  p  is  rather 
small. 

The  mean  tends  to  be  flatter  for  n  even  than  for  n  odd  . 

Var  (S*  j  p)  vs.  p  is  plotted  in  Figure  7  for  n  =  5-10,  15, 

20.  Again  symmetries  in  p  and  1-p  occur  and  Var(S+  I  p=0)  =  0 

n  l 

for  all  n  .  For  each  p  ,  Var(S*  |  p)  decreases  as  n  increases. 
For  n  =  15,  20  the  variance  is  nearly  constant  for  .2<p<.8. 

Figure  8  illustrates  the  behavior  of  med(S*  |  p)  for  n  =  5,  10, 
15,20  as  p  varies.  medfS^  |  p)  =  med(Sn  j  1-p)  for  all  n  and 
p  and  med(Sn  |  p=0)  =  1  for  all  n  .  is  a  discrete  random  vari¬ 
able  so  the  median  is  a  discontinuous  function  of  p  .  We  expect  that 

■f  I 

as  p  changes  to  p+6  where  6  is  small  that  medfS^  j  p)  and 

med(S*  j  p+6)  will  occur  at  values  of  s+  near  each  other,  however, 

since  P(S+  >  s+  Ip)  is  of  the  form  2  C,  (s+)  p^(l-p)n  ^  . 
n  k=0 

The  median  as  a  function  of  p  behaves  as  does  the  mode  and 

mean:  the  median  decreases  as  p  increases  from  0  and  then  in¬ 
creases  as  p  nears  .  The  following  theorem  and  conjecture 
suggest  the  median  has  a  local  maximum  at  p  =  y  for  all  n  . 

w 


Theorem  11.  For  all  n,  med(S+  p==-)  >1 
-  n  2 


Proof.  We  show  that  the  number  of  sequences  with  S^>1  is  at 


least  2 


n-1 


Note  the  following. 


(i)  The  sequence  with  n  l's  and  the  sequence  with  n  -l's  each 


have  S  =  1  . 
n 


(ii)  Any  sequence  with  k  -l's  where  k<|^  and  =  -1  has 


S»  >  n-l-2(k-l)  +1> 

n  —  n-1  — 


(iii)  Any  sequence  with  k  -l's  where  k>^  +  1  and  =  1 


,  0+  ^  ,  n-1  -2k  ^  , 

has  S  >  1  -  - - —  >  1. 

n  —  n-1  — 


(iv)  For  1  < 


the  sequence  with  n-2k  l's  in  the  first 


n-2k  places,  k  -l's  in  the  next  2k -1  places  and  a  1  in 

the  last  place  each  have  S  >  1  -  — —  =  1  . 

n  — 


(v)  For  1  <  k< 


2k 


the  sequences  with  a  -1  in  the  first 


place,  k  l's  in  the  next  2k-l  places  and  n-2k  -l's  in 

^  lc  _k 

the  last  n-2k  places  each  have  S  — —  -(-1)=  1  . 

n  2k 

(vi)  For  k  =  ^  the  sequence  with  a  1  in  the  first  place, 

j  -l's  in  the  next  n-2  places  and  a  1  in  the  last  place 

each  have  S+  >  1  . 

n  ~ 

All  of  the  sequences  described  above  are  unique;  the  total  number  of 
these  sequences  is  Q  where 


) 

J 
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If  n  is  odd  we  have 


r.  1 

2 

la- 2  \ 

2  +  2  z  rv'1)  >  2 

n-1  ; 

|  =  2 

k=l  * 

^  ~2~  1 

y 2  / 

Note  that  all  sequences  with  S+  >  1  have  not  been  included 
in  the  counting  for  Q  . 

Conjecture.  For  all  n  and  p,  med(S*  |  p)  <  1. 

The  proof  of  the  conjecture  requires  one  to  first  count  all  those 
sequences  with  S*  >  1  for  each  n  .  Such  a  task  appears  to  be 
impossible. 

A 8  a  consequence  of  Theorem  1 1,  note  that  E  (S  +  I  p=4  >  4 
for  all  n  . 

Critical  regions  for  the  test  vs.  H*  are  contained  in  Table  9 
for  n  =  5,  10,  15,  20  and  p  =  .  1,  .  2,  .  3,  .  4,  .  5.  Critical  regions  for 
p  i  .  6,  .  7,  .  8,  .  9  are  also  available  from  the  table.  See  Theorem  2. 
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Table  9.  P(S  +  >  s+  I  p). 
n  — 


n=  5 


.  1 

.2 

.  3 

.  4 

.  5 

2/1 

.074 

.  109 

.  122 

.  125 

.125 

3/2 

.098 

.186 

.254 

.298 

.  313 

4/3 

.  106 

.211 

.298 

.  355 

.  375 

n=6 

S  \ 

.  1 

.  2 

.  3 

.4 

.  5 

2/1 

.066 

.087 

.087 

.081 

.078 

8/5 

.093 

.157 

.  189 

.201 

.  20  3 

4/3 

.  103 

.  195 

.261 

.  300 

.  313 

6/5  •• 

.  109 

.228 

.  335 

.411 

.438 

n=7 

.X 

.1 

.2 

.  3 

.4 

.  5 

2/1 

.060 

.070 

.061 

.051 

.047 

5/3 

.089 

.136 

.154 

.132 

.125 

3/2 

.094 

.  161 

.199 

.215 

.219 

6/5 

.  108 

.223 

.  326 

.  397 

.422 

n=8 

X 

.  1 

.2 

.  3 

.4 

.5 

2/1 

.054 

.056 

.043 

.032 

.027 

12/7 

.086 

.119 

.  109 

.085 

.  074 

5/3 

.089 

.133 

.136 

.121 

.113 

3/2 

.090 

.143 

.  160 

.159 

.156 

10/7 

.099 

.182 

.  235 

.259 

.266 

8/7 

.  109 

.229 

.  344 

.429 

.461 

L 
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Table  9  (Continued). 


n=9 

X 

.  1 

.2 

.  3 

.4 

.5 

i  2/1 

.048 

.045 

.0  30 

.019 

.016 

14/8 

.082 

.  104 

.083 

.055 

.  043 

12/7 

.085 

.116 

.  103 

.078 

.  066 

5/3 

.085 

.  120 

.113 

.095 

.086 

8/5 

.086 

.  122 

.119 

.  105 

.098 

3/2 

.097 

.168 

.194 

.190 

.  184 

4/3 

.  104 

.191 

.243 

.265 

.270 

11/10 

.  108 

.225 

.  337 

.419 

.449 

n=10 

V 

•X 

.  1 

.2 

.  3 

.4 

.5 

2/1 

.044 

.036 

.021 

.012 

.009 

16/9 

.078 

.089 

.063 

.035 

.024 

14/8 

.081 

.  101 

.079 

.050 

.038 

12/7 

.082 

.  104 

.086 

.061 

.050 

5/3 

.082 

.105 

.089 

.068 

.059 

8/5 

.082 

.106 

.093 

.073 

.064 

14/9 

.095 

.153 

.157 

.133 

.119 

4/3 

.  105 

.  200 

.264 

.295 

.  303 

22/21 

.110 

.234 

.  358 

.452 

.487 

n=  1  5 


Table  9  (Continued) 


n=20 


»x. 

.  1 

.2 

.  3 

.4 

.  5 

2/1 

.015 

.004 

.001 

.000 

.000 

36/19 

.042 

.017 

.003 

.000 

.000 

9/5 

.045 

.021 

.005 

.001 

.000 

34/19 

.071 

.048 

.015 

.003 

.001 

22/13 

.074 

.056 

.021 

.005 

.002 

32/19 

.089 

.093 

.043 

.011 

.003 

5/3 

.090 

.  101 

.050 

.014 

.005 

38/24 

.090 

.  103 

.055 

.018 

.008 

30/19 

.097 

.137 

.090 

.033 

.014 

3/2 

.097 

.  146 

.  107 

.049 

.026 

28/19 

.099 

.  170 

.150 

.077 

.042 

16/11 

.099 

.  170 

.150 

.079 

.044 

26/18 

.  100 

.174 

.164 

.093 

.056 

24/17 

.  100 

.175 

.  168 

.  102 

.  066 

136/99 

.100 

.  176 

.  171 

.  108 

.074 

26/19 

.  100 

.189 

.210 

.151 

.109 

4/3 

.  102 

.193 

.223 

.175 

.137 

100/99 

.109 

.220 

.  332 

.442 

.493 
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Since  P(S+  >  s+  p)  Is  a  continuous  function  of  p  ,  approxima- 
n  — 

tions  to  critical  regions  where  p  assumes  a  value  other  than  those 
listed  can  be  made.  For  example,  suppose  n  =  15,  p  =  .  35  and 
S+  =  ~  .  Plotting  P(S^_  >  I  p)  vs.  p  we  find  that 
P(S*  >  I  p  =  .  35)  =  .  04  ;  see  Figure  10,  P(S+  >  s+  I  p)  is  not 
a  continuous  function  in  s"*”  ,  however.  Hence  interpolation  for  a 
significance  test  between  two  s  values  in  the  table  is  hazardous. 

The  values  attained  by  S+  are  not  uniformly  spaced  throughout  any 

n 

interval  in  critical  regions  nor  are  the  probabilities  of  the  values  of 
equal  size. 

The  following  interesting  properties  (A)  and  (B)  of 
P(S^  >s  I  p)  are  suggested  from  computations. 

f=  0  for  p  =  0,  1 

(A)  For  s  >1  and  any  n  ,  P(S  >  s  |  p)  < 

1  >0  for  pe(0,  1) 


and  either  (Al)  two  local  maximums  occur  at  say  p'  and  1 -p' 
where  p'  ^  and  a  local  minimum  occurs  at  p  =  i  or  (A 2)  a  maxi 
mum  occurs  at  p  =  ^  • 


(B)  For  s+<  1  and  any  n  ,  P(S+  >  s+  p) 

n 


=  1  for  p  =  0,  1 
<1  for  pe(0,  1) 


and  either  (Bl)  two  local  minimums  occur  at  say  p1  and  1-p' 
where  p '  4  j  and  a  local  maximum  occurs  at  p  =  —  or  (B2)  a  mini 
mum  occurs  at  P  -  j  • 
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n  =  15  :  s"*"  <  —•  -  .  6 

b 

n  =  20  :  s+<  =  .  571 

-  14 

1  + 

Once  the  minimum  occurs  at  p  =  ^  for  an  s  value,  it  occurs  at 
p  =  ~  for  all  remaining  smaller  values  s+  <  1  . 

Graphically,  P(S*  >  s+  |  p)  vs.  p  is  changing  as  s+  de¬ 
creases  from  2  to  0  in  the  following  way. 


This  behavior  and  Theorem  11  suggest  that  the  conjecture  following 
that  theorem  might  be  true. 

The  next  two  theorems  investigate  the  behavior  of  the  test 
statistic  as  n — >  n+1  . 

Theorem  12.  S"*"  (X , ,  •  •  • ,  X  )  <  S  .  (X . ,  •  •  • ,  '.C  ,  -1)  for  all  n. 

- - n  1  n  n+1  1  n 

+  1  J  1  n 

Proof.  Suppose  S  (X  ,  •  •  • ,  X  )  =  -r  £  X. - —  S  X.  where 

-  n  1  n1  J  i=1  i  n-j  .=j+1  i 

j  f  0,  n  .  Since  n+l-j  >  n-j  and 
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n+1  n  n 

2  X  =  2  X.  -  1  <  2  X.  we  have 

i=j+I  i=j+l  1  i=j  1 


n+i  !  n 

£  X.  <  -±r  2  X. 

n+1 -J  ■  ■  . ,  i  n-i  .  . , ,  i 

i=j+l  J  i=j+l 


i-  2  X.  - 


i=l  1  n'J  i=j+l 


2  X.  <  £  2  X  -  — i — 

1  J  i=1  1  n+1"j 


n+1 
2  X. 
i=j+l  1 


=^Sn(XV'-Xn>  <SL,  — 

For  those  sequence 8  X  ,  *  •  • ,  Xr  for  which  S+(X),*»*,X  )  = 

1  n  i  n  i  n+1  * 

-  ~  Z  x;  we  have  *  —  2  x  <  -  — — •  2  X.  .  For  those 

"  i  =  l  1  n  i=l  1  n+1  i=l  1 

1  1  ^ 

sequences  X  ...,X  for  which  S(X,,«*.,X)  =  -  2  X  we  have 

»  n  n  i  n  n  i  j  1 

J,  xi - f  «-*>  * 


Theorem  1 3.  ^(X^  •  •  ♦ ,  X^)  >  * ' "  >  xn»  1)  except  possibly 

"h“  Sn+l‘XI-— •X„->>  =  KTT  Xi  <  1  • 


+  i  j  i  n+ 1 

Proof.  Suppose  S  (X  •  •  • ,  X  ,  1 )  =  i-  2  X. - —  2  X 

n+i  1  n  J  .  .  l  n+l-i  .  .  i 

»=1  t=j+I 


for  some  j  ^  0,  n,  n  +  1  .  Since  2  X.  <  n-j  we  have 

i=j+l  1  " 


{(n+1  -j)  -  (n-j)}  2  X.  <  n-j 

i=j+l  1 
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n 

n 

2  X. 

« 

<  (n-j)  2  X.  + 

i=j+i 

i=j+l  1 

n 

n+1 

2  X. 

i 

<  (n-j)  2  X. 

i=j+i 

i=j+l  1 

n+1 


1  J  1 
2  X.  - 


n+1  j  n 

2  X.  <  r-  2  X. _ _  2 


>  ui  1  "+i'i  uj+i  ‘-J  U1  >  »-i  i.j+1 

=*‘<+i<3V  —  x»-"  i  S>r—-V 

for  those  sequences  where  S+  , (X,,  •  •  ■ ,  X  ,  1 )  does  not  occur  at 

n+1  1  n 

j  =  0,  n,  n+1  . 

Suppose  S^+1  occurs  at  j  =  0;  then 


-n  <  -  2  X. 

~  1=1  1 


n  n  n 

-n  2  X.  -  n  <  -n  2  X.  -  2  X. 

1=1  1  1=1  1  1=1  1 


-n  \  2  X.  +  1  \  <  -(n+1)  2  X. 

i=l  1  I  i=l  1 


n+1  n 

-n  2  X.  <  -(n+1)  2  X. 


i  =  l 

1 

i  =  l 

=£*  — L_ 

n+1 

n+1 

2  X. 
1=1  1 

1  n 
<  -  -  2 

"  n  1=1 
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=>  S+  .  (X  ,  •  •  • ,  X  ,  1 )  <  S+(X, ,  •  •  • ,  X  )  . 
n+1  1  ’  n  -  n'  1'  ’  n' 

S+  (X  •  •  •  X  ,1)  cannot  occur  at  j  =  n  .  See  Theorem  10. 
n+1  1  n 


Study  of  the  computations  for  P(S4  >s  +  j  p)  suggests  that 

for  all  n  and  p  and  useful  c<  levels  it  is  usually  true  that 

p)  ;  when  the  inequality  is 


P  (S+  >  s+ 
n  — 


?)  >  P(Sr+..  >  s  + 


n+1  - 

not  true  the  difference  between  the  probabilities  is  quite  small. 

Thus  if  S^(x^,  •  •  • ,  x^)  is  in  the  critical  region  for  a  size-**  test 

then  i(xi«  *  *  *  *  xn»  -1)  is  in  the  critical  region  for  an  approximate 

size-X  test  by  Theorem  12.  For  the  exceptional  sequences  in  Theorem 

1 3,  S+  (x  •  •  • ,  x  ,  1 )  <  1  and  these  sequences  are  not  in  the  critical 
n+1  1  n 

region  for  any  reasonable  size-  o<  test  for  any  n  .  For  those  sequences 

for  which  the  inequality  in  Theorem  13  is  true,  we  cannot  say  in  general 

that  •  •  • ,  xn>  1)  is  not  in  the  critical  region  for  an  approximate 

size-«  test  when  S+(x  ,  ,  x  )  is  not  in  the  critical  region  for  a 

n  1  n 

size-<X  test.  One  must  consider  more  closely  the  particular  sequence 

x  ,  •  •  • ,  x  and  the  p robability  p  . 

In 

More  generally,  computation  indicates  that  for  n>20  we  can  test 

H  vs.  as  follows.  Suppose  S+  =  s+  :  choose  the  largest  value 

0  1  n  n  6 

s+  of  S*  so  that  s+<  s+.  If  s4  is  in  the  critical  region  for  a 
20  ~  n  ® 

size-<X  test  for  sample  size  20  then  s^  will  be  in  the  critical  region 
for  an  approximate  size-cx  test  for  sample  size  n  . 
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.  l  j  1  n  I 

Theorem  15.  If  S  (X. .  •  •  • ,  X  )  =  |  -  2  X.  -  — r  2  X.  | 

-  n  1  n  |  i  .  ,  i  n-i  .  .  ,  i  ■ 

J  i=l  J  i=j+l 

and  j  i  0,  n  ,  then  X .  =  -X . . ,  . 

J  ’  J  J+l 

Hence  to  evaluate  S  for  a  sample  sequence  one  need  only 
n 


I  1  r  1  n 

calculate  —  2  x.  -  -  2  x. 

I  r  .  .  i  n-r  .  .  l 

i=l  i=r+l 


for  those  r  for  which 


x  =  -x  .  ,  and  if  each  value  calculated  is  less  than  1  then  also 
r  r+1 

for  r  =  0  . 


For  n  =  1 -10,  15,  20,  o(0_c  )  vs.  n  is  plotted  in  Figure  1 1. 

n 


As  with  o(0.g+)  ,  o<ns  )  increases  rapidly  with  n.  log^t  <*ns » 
n  n  n 

vs.  log2n  is  plotted  in  Figure  12.  For  n  =  4-10,  15,  20  the  graph  is 
almost  a  straight  line.  The  best  least  squares  straight  line  fitting 
these  points  is  log,(o(/l_Q  ))  =  2.  3221  log2n  -  3.0805  so  an  approxi- 

S 


/PI  i  “  .  .  ,r\  s  ,-3.0805  2.  3221 

mation  to  o(I  j.g  )  may  be  found  by  o(Wg  )  =2  n 

n  n 


In  Section  2.3  we  found  an  approximation  to  o(_0.g+)  as 


n 

o(/l.s+)  =  2  2,456  n2'2^6  and  since  o(Q  g+)  >o(flg  )  for  n>2, 
n  n  n 

one  cannot  approximate  the  orders  of  the  sample  spaces  for  n  >  20 

with  confidence  using  these  formulas. 

We  next  investigate  the  behavior  of  the  .probability  density 

and  its  mode,  mean,  variance  and  median  for  varying  n  and  p  .  In 

Figures  13.1,  13.2  are  graphed  P(Sn  =  s  |  p)  vs.  s  for  n  =  10  and 

20,  for  p  =  .  1,  .2,  .3,  .4,  .5  and  for  those  s  for  which 


PRECEDING  PAGE  BLANK-N0T  FILMED 
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Figure  11.  °(Ag  )  va.  n 
n 


L 
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P(^n  -  s  |  p)  >  .01  .  By  Theorem  6  the  probability  densities  for 
p  =  .  6,  .7,  .8,  .9  are  the  same  as  those  for  1 -p  and  hence  are 
omitted. 

For  each  n  plotted  as  p:  .1 - *.5  the  number  of  "large" 

probability  mas ses  are  shifting  away  from  extreme  s  values.  The 

same  phenomenon  occurs  for  S  for  large  extreme  values. 

The  mode  of  S  behaves  as  the  mode  of  S+  in  regard  to  the 
n  n 

shifting  through  s  values  and  the  change  in  the  mode's  probability 
as  p  varies. 

Figure  14  illustrates  the  behavior  of  E(S  I  p)  for  n  =  5-10 

n  f  r  ’ 

15,20  as  p  varies.  The  mean  is  symmetrical  about  p  =  —  for  all 

n  due  to  Theorem  6  and  E(Sn  |  p  =  0)  =  1  .  For  each  n  plotted 

E(Sn  |  p)  increases  as  p:  0 — »  and  for  each  p,  E(Sn  |  p)  is 

a  decreasing  function  of  n  .  Hence  the  range  of  expected  values 

over  p  for  given  n  decreases  as  n  increases;  for  large  n  and  p 

near  —  the  expected  values  are  fairly  constant. 

In  comparing  E(Sn  |  p)  to  E(S*  j  p)  for  n  =  5-10,  15,  20 

note  that  both  have  at  least  local  maximums  at  p  =  —  and  both  in 

2 

general  are  decreasing  functions  in  n  for  fixed  p  .  Also  note  that 
E(Sn  |  P)  >  E(S^  |  P)  *or  a^  P  and  n  . 

Var  (Sn  |  p)  vs.  p  is  sketched  in  Figure  15  for  n  =  5-10,  15,  20; 
Var  (Sn  |  p)  =  Var  (Sn  |  1  -p)  Vp,  n  and  Var  (S  |  p  =  0)  =  0V  n  . 

For  each  n  plotted  local  maximums  occur  at  approximately  p  =  .2 


/ 

'Jr 
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and  .8  and  a  local  minimum  occurs  at  p  =  —  ;  such  a  behavior  is 
expected  from  the  shift  in  probabilities  for  S  as  p:  0  — *•  i  .  The 

XI  w 

variance  depends  heavily  on  p  and  for  large  n  is  quite  small  for  p 
near  j  •  For  *11  n  the  variance  is  approximately  the  same  for 
fixed  p,  where  p<  .2  and  p  >.8  . 

In  Figure  16  is  graphed  med  (S  I  p)  vs.  p  for  n  =  5,  10,  15, 

n  1 

20.  The  median  is  symmetric  in  p  and  1-p  ;  med  (S  p  =  0)=  1 
for  all  n  and  maximums  occur  at  p  =  ^  for  each  n  plotted;  E(Sn  j  p) 
behaves  similarly.  For  larger  n  however  the  median  decreases  as  p 
increases  from  0  and  then  increases  as  p  nears  i ;  the  mode 
exhibited  this  behavior. 

Since  med  (S^  |  p)  >  med  {S*  |  p)  for  all  n  and  p  , 
med  (S  |  p=4  )  >  1  for  all  n  by  Theorem  11,  It  is  tempting  to  con- 
jecture  that  med  (S^  |  P  =  g- )  >  xned  (S^  |  p)  for  all  p  and  n  . 

Critical  regions  for  the  test  vs.  are  contained  in 
Table  17  for  n  =  5,  10,  15,  20  and  p  =  ,  1,  .2,  .3,  .4,  .5;  critical 
regions  for  1-p  are  available  from  the  table  by  Theorem  6.  As  with 
the  test  Hq  vs.  h|  interpolation  can  be  used  to  find  critical  regions 
where  p  assumes  a  value  other  than  those  listed  in  the  tables. 

Again  interpolation  for  a  significance  test  between  two  s  values  in 
the  table  is  hazardous. 

For  fixed  n  and  s  ,  P(Sn  >  s  |  p)  as  a  function  of  p  behaves 
as  did  P(S+  >  s+  Ip).  Properties  (A)  and  (B)  on  page  70  are 
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Table  17  (Continued). 


n=10 

X 

.1 

.2 

.  3 

.4 

.5 

2/1 

.087 

.072 

.042 

.024 

.018 

16/9 

.152 

.172 

.121 

.067 

.047 

14/8 

.158 

.196 

.153 

.097 

.074 

12/7 

.159 

.201 

.166 

.118 

.098 

8/5 

.160 

.206 

.  180 

.143 

.127 

14/9 

.182 

.287 

.291 

.246 

.221 

22/21 

.208 

.419 

.605 

.735 

.781 

n=l  5 

X 

.1 

.2 

.3 

.4 

.5 

2/1 

.051 

.023 

.007 

.002 

.001 

26/14 

.115 

.078 

.029 

.007 

.002 

24/13 

.122 

.092 

,038 

.011 

.004 

14/8 

.123 

.096 

.044 

.016 

.009 

12/7 

.163 

.174 

.098 

.037 

.019 

22/13 

.167 

.192 

.119 

.050 

.027 

5/3 

.167 

.196 

.127 

.058 

.034 

16/10 

.168 

.198 

.132 

.067 

.044 

11/7 

.183 

.266 

.211 

.115 

.074 

14/9 

.183 

.266 

.212 

.117 

.077 

20/13 

.  185 

.280 

.241 

.147 

.103 

41/28 

.185 

.285 

.  .254 

.  170 

.128 

13/9 

.185 

.286 

.260 

.182 

.  144 

20/14 

.189 

.  326 

.341 

.263 

.215 

14/13 

.205 

.  399 

.576 

.715 

.770 
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Table  17  (Continued). 


n=20 

\P 

s\ 

.1 

.2 

.3 

.4 

2/1 

.030 

.008 

.001 

.000 

36/19 

.083 

.033 

.006 

.001 

34/18 

.089 

.039 

.008 

.001 

9/5 

.089 

.041 

.010 

.002 

34/19 

.137 

.093 

.028 

.005 

16/9 

,142 

.105 

.035 

.007 

22/13 

.143 

.108 

.040 

.010 

32/19 

.  170 

.175 

.080 

.020 

5/3 

.173 

.190 

.095 

.027 

28/17 

.173 

.193 

.  101 

.031 

38/24 

.173 

.194 

.  104 

.035 

30/19 

.  184 

.254 

.165 

.061 

142/91 

.184 

.254 

.  165 

.062 

20/13 

.185 

.267 

.  189 

.078 

3/2 

.186 

.271 

.199 

.092 

28/19 

.189 

.311 

.270 

.139 

16/11 

.189 

.311 

.271 

.142 

26/18 

.  189 

.320 

.296 

.169 

140/99 

.189 

.320 

.297 

.172 

24/17 

.189 

.321 

.305 

.187 

136/99 

.190 

.  323 

.310 

.199 

26/19 

.191 

.344 

.372 

.266 

22/21 

.206 

.396 

.563 

.709 
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satisfied  by  S  .  For  s  near  2  (Al)  is  satisfied,  as  s  approaches 
n 

1  (A2)  is  satisfied,  and  if  is  the  largest  value  for  which  (A2)  is 
satisfied  then  for  each  s  ,  s^  <  s  <  1  ,  (A2)  is  satisfied.  For  s  <  1 


either  (Bl)  or  (B2)  is  satisfied. 

For  very  large  s  we  have  that  P(Sn  >  s  |  p)  =  2P(S*  >  s  |  p). 

One  expects  near  equality  for  large  s  since  if  X^,  has  = 

S+  =  s  then  X  ,  •••,X,  also  has  S  =  s  .  The  equality  breaks  down 
n  n*  1  n 

when  X,,»««,X  =  X  ,«**,X  . 

1  n  n  l 

Study  of  computations  suggests  that  for  all  n  and  p  and  useful 
levels  it  is  usually  true  that  P(S^  >  s  |  p)  >  P(Sn+^  ^  s  J  p);  when 
the  inequality  is  not  true  the  difference  between  probabilities  is  quite 
small.  Tests  for  n  >  20  can  be  obtained  by  using  this  fact. 

■|>i  ^ 

2.5  The  Covariance  of  S  and  R 

n  n 

Recall  that  in  general  for  each  value  s+e-0.„+  there  are  only 

n 

two  points  j,  n-jeO^+  at  which  the  value  is  assumed;  exceptions 

+  n  + 
occur  for  s  =2  and  a  few  more  "popular"  values  such  as  s  =1,  0. 

A  measure  for  further  investigating  the  relationship  between  S*  and 

R*  is  provided  by  dieir  covariance.  Since  p  is  estimated  only  when 

the  null  hypothesis  is  rejected,  the  covariance  of  S+  and  R+  given 

n  n 

+  + 

that  Sn  >  s  is  of  particular  interest.  Theorem  17  describes  the 


behavior  of  the  conditional  covariance;  Lemma  16  will  be  useful. 
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Lemma  16.  For  each  n,  pe(0,  1)  and  s+,  E(R+  Ip,  S+  >s+) 

.  n  I  n  — 

+  E(R*  j  1  -p»  »+)  *  n;  for  each  pc  Jo,  and  s+  >  1,  E(R*  j  P. 

S+>  e+)  <_E(R+  |  1-p,  S+  >  s+);  for  each  n  and  s+,  E(R+  |p  =  0, 
n  —  n  I  n  —  n  1 

S+  >  s+)  =  0  . 

n  — 

Proof.  Note  that 

E(R^  |  p,  S+  >  s+)  +  E(R+  |  1-p,  S+  >  s+) 
n  >  n  n  f  n 

=  E  j{P(R+  =  j  jp.  S+  >8+)  +  P(R+  =  j  |  1-p,  S+  >s+)} 

j=0 

=  2  j  {P(R^  =  j  I  P.  S+  >  8+)  +  P(R+  =  n-j  Ip,  S+  >  s+)} 

.  A  n  i  n  n  i  n 

j=0 

=  E  {j  P(R+  =  j  |  p,  S+  >  8+)  +  (n-j)  P(R+  =  j  |  p,  S+  >  8+)> 
j=0 

=  n  E  P(R+  =  j  Ip,  S+>  8+)  =  n  . 
j=0  n  ' 

Let  d^(j  |  8+)  denote  the  number  of  sequences  with  k  l's, 

S*  >  s+  and  R*  =  j  .  It  can  be  shown  algebraically  that  E  d*(j  |  s+) 

1  +  ,n+  +  j=0 

>.  E  d^(n-j  I  s  )  for  each  s  >1  and  for  each  i  and  k  where 
j=0 

0<i<.^j  and  0<k<£jJ.  Hence 

E  E  (n-2j) 
j=0  k=0 

>  0 


K'i  I 


,+>  -  d+(n-j  |  /)]  [ 


k.,  .n-k  ,k  n- 
p  (1-p)  -(1-p)  p 
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for  pe(0,  —  )  and  s+  >  1  ;  this  implies  that 
n  n 

S  j  P(R+  =  j  I  1-p,  S+  >s+)  >  Z  j  P(R+  =  j  I  p.  S+  >s+)  . 
.  n  i  n  .  ^  n  i  n 

3=0  3=0 


It  is  quite  likely  true  that  E(R+  Ip,  S+  >  s+)  <  E(R+  I  1-p, 

n  >  n  n  I 

S+>  s+)  for  all  s+,  n  and  pe(0,  ~  )  ;  the  proof  would  involve  more 
complicated  algebraic  manipulations  than  for  the  situation  where  s+  >  1. 


Theorem  17.  For  each  n,  p  and  s+,  Cov  (S+,  R+  |  p,  S+  >s+)  + 
-  n  n  1  n  — 

Cov  (S+,  R*  J  1-p,  S*  >s+)  =  0  ;  for  each  n  and  p  ,  Cov  (S*,  R*  |  p, 

S+  =  2)  =  0  and  for  each  n  and  s+,  Cov  (S+,  R+  I  p  =  0,  S+  >  s+)  =  0  . 
n  n  n  i  n  — 


Proof.  Observe  that 

E(S+  R+  Ip,  sSs+) 
n  n  i  n  — 


S  s+ j  P((S+,R+)  =  (s+,j)  |p,S+>8+) 

8+.3 

S  s+jP((S+RV(s+,j)  |l-p,sSs+) 

^  n  n  1  n  — 

8  J 

S  {ne  +  P((S+R+)  =  (s+,j)  |l-p.S+>s+) 

8+»  j 

(n-j)  s+  P«S+,  R+)  =  (8+,  j)  J  1  -p,  S+  >  s+) } 
n  E(S^  |  1  -p,  S+  >  s+)  -  E(S+  R*  1 1  -p. 
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Also  observe  that  E(S+  Ip,  S+  >  s+)  =  E(S+  I  1 -p,  S+  >  s+)  for  all 

n  i  n  —  n  *  n  — 

n,  p  and  s+.  Apply  Lemma  16.  H 


For  s+  =  2  the  conditional  covariance  of  S+  and  R+  is  zero: 

n  n 

for  s  <  2  computation  indicates  that  the  conditional  correlation  may 

be  positive  or  negative  for  pe(0,  ^-)  .  More  specifically  for  pe(0,  —  ) 

the  correlation  is  negative  for  s'*",  1  <  s+<  2  ,  and  is  positive  when 

s+  is  quite  small.  The  magnitude  of  the  correlation  of  S+  and  R+ 

n  n 

for  given  p  and  s+  is  not  large.  Verification  of  this  behavior  of  the 
correlation  for  arbitrary  n  and  given  p  and  s+  appears  to  be 
intractable. 


2. 6  The  Covariance  of  S  and  R 

n  n 

The  conditional  covariance  of  S  and  R  given  that  S  >  s  is 

n  n  n  — 

identically  equal  to  zero  for  all  n,  p  and  s;  and  R^  are  definitely 
not  independent  random  variables  however.  Lemma  18  indicates  that 
the  conditional  mean  of  R^  does  not  depend  on  p  or  s  . 


Lemma  18.  For  each  n,  pe(0,  1)  and  s,  E{R  p,  S  >s)  =  — 
■  n  n  —  2 


Proof.  E(R  I  p,  S  >  s)  =  £  i  P(R  =  i  I  d.  S  >  e) 

-  n  I  n  —  .  „  n  I  r  n  - 

3=0 

=  \  £  {j  P(Rn  =  j  |P,  Sn>s) 


+  (n-j)  P(R  =  n-j  Ip,  S  >  s)} 
n  •  n  — 
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-  J  ^{ip(Rn  =  j|p.  «„>•) 

+  (n-j)  P(Rn  =  j  |  P»  Sn  >  s)} 
n 


Theorem  19.  For  each  n,  p  and  s,  Cov  (S^,  R^  |  p,  >  s)  =  0  . 
Proof.  Note  that 


E(S  R  p,  S  >  a)  = 
n  n  n 


2  aj  P((S.  R  )  =  (a,  j)  I  p,  s  >  a) 
n  n  f  n 

*»  j 


=  2.  S  a  P((S  ,  R  )  =  (a.  j)  I  p,  S  >  a) 

2  n  n  In 

=  ~  2  a  P(S  =  s  I  p,  S  >  a) 

Z  n  i  n 

a 

*  I  I  P-  S„  >  *>  • 


Apply  Lemma  18. 


2.7  Properties  of  R* 

Study  of  the  change  point  random  variable  R*  provides  proper¬ 
ties  for  the  estimator  p  of  the  change  point.  Although  R+  is  of 
interest  in  its  own  right,  note  that  p  is  estimated  only  when  is 
rejected  as  a  study  of  the  random  variable  =  (R+  I  S+>s"1’  )  is 

of  primary  concern.  In  this  section  properties  of  both  random 
variables  are  investigated.  First  we  examine  the  behavior  of  the 


A 


■5 
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density  of  R+  and  its  mean  and  variance  as  functions  of  n  and  p  . 

Next  p*  ^  is  investigated  and  a  comparison  is  made  between  ^ 
and  the  corresponding  change  point  random  variables  for  the  generalized 
likelihood  ratio  test  statistic  (1.4.1)  and  Page's  test  statistic  (1.4.2). 
The  section  is  concluded  proposing  an  alternative  to  randomizing  for 
estimating  p  when  is  rejected  and  when  is  not  uniquely 


defined  by  considering  what  happens  to  as  n- 


n+1  . 


The  behavior  of  the  densities  of  R  is  illustrated  in  Figures 

n 

18.1,  18.2  for  n  =  10,  20  and  p  =  .l,  .2,  .3,  .4,  .5.  The  densities 


for  p  =  .6,  .7,  .8,  .  9  are  related  to  those  for  1  -p  by  P(R^  =  j  |p)  a 
P(R+  =  n-j  I  1-p)  according  to  Theorem  2.  We  examine  the  den¬ 


sities  separately  for  R+  =  0  and  R+e  {!,*••,  f^l}  . 


n 

^  1& 

The  sequences  for  which  Rr  =  0  are  of  the  type  T^(0,  0)  where 

k  <  j^J  so  P(R*  -  0  f  p )  =1,0  when  p  =  0,  1  respectively.  For 

sequences  of  this  type  it  is  easy  to  show  that  for  each  n,P(R*  =  0  |  p) 

decreases  as  p  increases  at  least  for  pe^j-.lj  >  ft  appears  that  this 

is  also  the  tendency  for  pe(0,  j  )  .  For  each  p,  P(R+  =  0  j  p)  tends 

to  decrease  as  n  increases.  The  proportion  of  sequences  for  which 

R+  =  0  is  decreasing  with  n  ;  note  that  .091  of  the  2^^  sequences 
11 

have  R*  =  0  and  about  .  042  of  the  2^  sequence  have  R+  =  0  . 

1 U 

For  je  {l,  •  •  * ,  [■“'T']  )  there  are  in  general  more  sequences  for 
4-  + 

which  R  =  j  then  R  =  j+1  .  To  gain  insight  as  to  why  this  is  the 
n  n 

case  consider  those  sequences  for  which  R+  =  1  and  R+  =  2  .  If 

n  n 
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R*  =  1  then  =  1,  x^  =  -1  and  there  may  be  k  I's  in  the  last 

n-2  places,  0  <  k<  .^.n~2ij  ;  if  R+  =  2  then  =  1  or  -1, 

x^  s  1,  x  =  -1  and  there  maybe  k  l's  in  the  last  n-3  places 

where  0<k<  [— ■  ]  if  Xj  =  1  and  0<k<  [SZ^J  if 

x^  =  -1  (see  Theorems  3  and  10).  Thus  the  total  number  of 

sequences  for  which  S+  may  occur  at  j  =  1  is  larger  than  for 

n  ° 

j  =  2  .  Not  all  of  the  sequences  described  actually  have  R+  =  1  or  2 

n 

however.  Forgiven  k  if  many  of  the  l's  are  near  the  beginning  of 

the  sequence  S*  may  occur  at  some  larger  point  j  ;  note  that  the 

sequences  for  which  x^  =  -1,  =  1,  x  =  -1  are  more  sensitive  in 

this  situation  and  are  more  likely  to  have  S*  occurring  j  >  2  . 

Observe  next  that  for  those  sequences  for  which  R*  =  j  there 

generally  are  a  larger  proportion  of  l's  than  -l's  in  Xj,*»*,Xj 

and  a  larger  proportion  of  -l's  than  l's  in  x..,,««»,x  since 

j+1  n 

S*  cannot  be  negative.  Hence  for  p  near  C  those  points  near  j  =  1 

are  most  probable,  for  p  near  1  those  points  near  j  =  n-1  are 

most  probable  and  for  p  near  —  the  extreme  points  j  encompass 

2 

most  of  the  probability  mass. 

Thus  for  je{l,  ••  •,  n-l}  and  for  given  p  the  shape  of  the  prob¬ 
ability  densities  will  be  similar  for  abitrary  n.  Moreover  calculation 

indicates  that  for  the  more  probable  extreme  points  j,  P(R+  =  j  I  p) 

“l 

=  P(Rn  *  j  j  p)  for  all  ttj,  n^  and  p  .  Among  the  less  probable 
2 

points  j  the  remaining  probability  mass  is  similarly  distributed  for 


105 


each  n  and  p  ;  the  probability  of  each  of  these  points  decreases  as 
n  increases  for  each  p  . 

Figure  19  displays  E(R*  |  P)  vs.  p  for  n  =  5,  10,  15,  20.  For 
each  n  the  mean  is  an  increasing  function  in  p  and  for  each  p  is 
an  increasing  function  in  n  ;  in  fact  E(R*  |  p)  =  np  for  each  n  and 
p  .  Lemma  16  describes  the  relationship  between  the  means  at  p 
and  1  -p  . 

Var  (R+  I  p)  vs.  p  for  n  =  5,  10,  15,  20  is  plotted  in  Figure  20. 
The  variance  is  symmetric  in  p  and  1  -p  ,  is  zero  for  p  =  0,  and 
is  maximized  for  p  s  r-  for  all  n  as  is  expected  from  the  manner 
in  which  the  probability  masses  shift  as  p  varies.  For  fixed  p  the 
variance  increases  with  n  and  for  each  n  the  variance  increases  as 


4 

It  will  be  seen  next  that  many  of  the  properties  of  R^  and  the 

conditional  random  variable  =  (R+  I  S+  >  s+  )  are  similar. 

n  1  n  —  p 

Note  however  that  R*c{0,  ••♦,n}  whereas  we  would  like  p  to  satis¬ 
fy  0  <  p  <  n  .  We  have  assumed  that  a  change  in  the  model  may  have 
occurred  only  after  sampling  began,  that  is,  p>0,  and  p  is  esti¬ 
mated  only  when  the  null  hypothesis  that  p  =  n  is  rejected.  Observe 
that  the  estimator  p  is  consistent  with  both  criteria:  the  largest  value 
of  S*  for  which  R*  =  0,  n  is  s+  =  1  ;  for  each  n  and  p,  P(S*  >  1  |  p) 
is  very  large  and  this  s+  will  not  be  in  the  critical  region  for  any 
reasonable  size- «  test.  Hence  0<p<n  as  desired. 
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Each  je  {l,**»,n-l}  is  a  possible  estimator  for  p  when 

pk(l-p)n"k  since  for  s+  =  2,  P((sJ  ,rV  (2,j)  I  p)  = 
k=l  n  n  1 

p^(l-p)n  ^  .  Thus  all  possible  change  points  have  positive  probability 

of  being  detected  when  is  rejected.  As  will  be  shown  in  Section 

2.9  the  next  j^J  largest  values  attained  by  S*  in  decreasing  order 

occur  at  j  and  n-j,  j=l,  •••,  respectively. 

In  contrast  to  the  estimator  p  for  the  test  statistic  S+  ,  con- 
sider  the  estimators  for  the  change  point  for  the  generalized  likeli- 
hook  ratio  test  statistic  (1.4.1)  and  Page's  test  statistic  (1.4.2).  For 
each  of  these  statistics  when  the  null  hypothesis  that  no  change  has 
occurred  is  rejected,  p  =  0  is  a  possible  estimator  for  the  change 
point;  in  fact  for  each  test  the  most  extreme  values  in  the  critical 
regions  correspond  to  a  change  in  the  model  occurring  before  sampling 
began.  For  the  generalized  likelihood  ratio  test  each  p  <  n-1  is  a 
possible  estimator  for  p  when  p  is  small  and  q  is  large;  when  q 
is  not  large,  however,  recent  change  points  cannot  be  detected.  More¬ 
over  recent  change  points  cannot  be  detected  as  p  increases  to  larger 
values.  Page's  test  statistic  behaves  similarly  and  for  fixed  p  is 
even  less  able  to  detect  changes  late  in  the  sequence  than  the  general¬ 
ized  likelihood  ratio  test  statistic. 

Next  the  probability  densities  for  p  +  are  examined.  The 

form  of  the  densities  are  the  same  as  those  for  R  :  for  the  same 

n 

reasons  it  is  also  true  for  p  +  that  when  p  is  near  0  those  points 

n, 
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near  j  =  1  are  moat  probable  and  when  p  ia  near  ~  the  extreme 

it 

points  are  moat  probable;  the  denaitiea  for  p  and  1-p  are  related 
by  P(p*  u  -  j  |  P)  =  ^  =  n"j  |  1"P)«  As  a  function  of  s+,  when 


n,  cx 


s+  decreases  to  the  next  smaller  value,  aay  s  +  ,  then  P(p+  =  j  Ip) 

7  p  r  n/X  1 

increases  for  those  points  j  at  which  s  +  occurs  and  decreases  for 

«*•  P 

all  other  points  j  . 

For  n  =  10  and  p:,l,  .3,  .5,  Table  21  lists  P(p*  _  =  j  |  p) 

XVpCK  1 

for  j  =  1,  *  ••,  9  ,  and  the  six  largest  values  of  and  the  smallest 

22  + 

value  —  of  which  is  greater  than  1  ;  for  each  p  the  last  line 

displays  P(R*0  =  j  |  p)  for  comparison.  For  many  values  of  S*Q 
and  for  p  sufficiently  small  the  more  probable  extreme  points  near 
j  =  1  encompass  even  more  of  the  probability  mass  for  ^  than 


for  R 


10  * 


+  11 

The  mean  of  j?  is  sketched  in  Figure  22  for  pe(0,  —  and 
10, <X  ZJ 

s+  =  2/1,  16/9,  8/5,  22/21  and  0/1  .  Lemma  16  describes  the 


relationship  between  the  means  at  p  and  1  -p  .  For  each  s  the 
mean  is  tied  down  at  0,  j  when  p  a  0,  ~  ,  respectively.  As  with 
R*  ,  E(p*  |  p)  is  an  increasing  function  in  p  for  each  s+  . 

For  s+  >  1  ,  E(R*  |  p,  S+  £  s+)  >  E(R*  |  p)  when  p  is  near  0  and 


‘10  '  10  '  '10 
E(Rjq  |  P>  S*  >s+)  <  E{R*^  |  P)  when  p  is  near  j  .  Because  of  the 

nature  of  change  in  the  density  of  p*  as  s  +  decreases,  no  mono- 

1 0,X 


tonic  properties  as  functions  in  s  can  be  stated  for  the  mean  of  p 
for  arbitrary  p  • 


10,<* 
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E(Pio.*lp> 


Figure  22. 


Figure  23  displays  the  variance  of  p*  for  pe(0,  X- )  and 

1  Uf  z 

the  same  values  of  s+.  Symmetries  occur  in  p  and  1  -p,  that  is. 


Var  (p^  ^  |  p)  =  Var  (p^  ^  |  1-p)  .  The  variance  is  an  increasing 
function  in  p  for  pe  Jo,  i-j  for  each  s+  .  Again  due  to  the  nature 
of  change  in  the  densities  as  s  +  decreases  no  monotonic  properties 
in  s+  can  be  stated  for  the  variance  of  p*  for  arbitrary  p  . 

X  Of 

The  tendency  though  is  that  as  s  +  decreases  to  the  next  smaller 
value  s+  then  the  variance  decreases  for  p  near  0  and 

«,  p 

increases  for  p  near  4-  when  s+  occurs  nearer  to  j  =  1,  n-1 
than  the  previous  value  of  s+  .  The  opposite  tendency  occurs  when 
s^  p  occurs  nearer  to  j  =  than  for  the  previous  value  of  s+  . 

This  section  is  concluded  by  presenting  an  alternative  to  random¬ 
izing  for  estimating  p  when  is  rejected  and  S+  does  not  occur 
at  a  unique  point.  For  sequences  in  the  critical  region  S+fx^,  •  •  • ,  x^) 
>  1  and  it  can  be  shown  algebraically  that  if  J*(Xj,  ••  •»  x^)  = 

{ j  1  *  *  *  * »  jf)*  *  >  2,  then  J„+1  (xj»  *  *  * »  xn>  xn+1 )  i8  single -valued. 

It  is  somewhat  unexpected  but  in  fact  J+ , ,  (x, ,  •  •  • ,  x  ,  x  . )  = 
r  n+1  1  n  n+1' 

min  j.  whether  xn+^  =  1  or  -1  .  Instead  of  randomizing  then 
i= !,•••,  f 


one  might  as  well  estimate  p  by  min  j.  and  q  using  the  last 

1=1,  •  •  • ,  i  1 

n-  min  j,  observations.  For  sequences  with  S+(x,,-**,x  )<1 
...  i  n  1  n  — 

i=l,  ••  •,! 


*(x  ,  ••*,x  )  =  {j  ,  •••,  j  }  ,  i  >  2,  it  is  not  true 
n  i  n  l  i 


and  J 


WV'-VWV  ,mi”  .  ji  ai  xn+l 

1=1,  •  •  *,  f 


Ill 


figure  23.  Var^J0>oJP)  VB»  P  • 


2.  8  Properties  of  R^ 

In  this  section  the  random  variables  R  and  8  = 

n  rn.  ,c* 

(R  I  S  >  s  )  are  studied.  The  density  of  R  and  its  mean  and 
n  I  n-  p  n 

variance  are  first  examined;  comparisons  are  made  between  R^  and 

R+  .  Next  p  is  investigated;  it  will  be  seen  that  R  and  p 
n  Hn,  <x  6  n  Kn,<x 

behave  similarly. 

The  densities  of  R  are  sketched  in  Figures  24.  1,  24.  2  for 
n 

n  =  10,  20  and  p  =  .  1,  .  2,  .  3,  .  4,  .  5  ;  in  contrast  to  the  densities 
for  R*  note  that  the  densities  for  R^  are  symmetrical  in  j  and  n-j 
and  P(R  =  j  I  p)  =  P(R  =  j  j  1-p)  for  all  n,  p  and  j  by  Theorem  6. 
Observe  the  following  behavior  of  the  densities  as  suggested  by  theory 
and  computation. 

The  sequences  for  which  R^  =  0,  n  are  of  the  types  T^fO,  0), 


Tk(n,  k)  where  k,  n-k  <  if  n 


is  even  or 


k,  n-k  <  ^  n 


is  odd  by  Theorem  7.  For  these  types  of  sequences  the  tendency  is  for 


P(Rn  =  0  |  p)  to  decrease  as  p:  0 — —  .  For  each  p,  P(R^  =  0  |  p) 
tends  to  decrease  as  n  increases  and  the  proportion  of  sequences  with 
R^  =  0  is  decreasing  with  n  .  Observe  that  P(R^  =  0  |  p)  < 

P(R*  =  0  j  p)  for  pe(0,  —  )  ;  this  is  expected  since  =  max  {S*  , 

*Sn  }  * 

For  je  <1,  •  •  • ,  it  appears  to  be  the  case  that  E  d^fi) 

n  L  J  k=0 


For  je  <1,  •  •  • ,  it  appears  to  be  the  case  that  E  d^fi) 

n  L  J  k=0 

>  E  d  (j+1)  ;  in  general  those  sequences  for  which  R  =  j  either 
k-0  k  n 

have  a  larger  proportion  of  l's  than  -l's  in  Xj,**»,Xj  and  a  larger 


H6 


proportion  of  -l's  than  l's  in  xj+2,,,,#Xn  °r  v*ce  versa*  Thus 
the  extreme  points  near  j  =  1,  n-1  are  most  probable.  Also  calcula- 

“on‘  indica“  ^ for  *• more  probable  “,reme  points  j  ■  P(vj  1  p> 

=  P(R  =  j  I  p)  for  each  n  ,  n  and  p  .  The  same  phenomenon 
n2  12 

occurs  for  R  .  Furthermore  note  that  for  each  n,  pe{0,  —  )  and 
n  r  2 

j  ,  P(R*  -  j  |  p)  =  P(R^  =  j  |  p)  ,  as  is  anticipated  from  the 

manner  in  which  S+  and  S  are  defined, 
n  n 

Due  to  the  symmetry  in  j  and  n-j  for  the  density  of  R^, 

E(R  I  p)  =  -j-  and  Var  (R  I  p)  =  Var  (R  I  1 -p)  for  all  n  and  p  . 


Figure  25  displays  Var  (R^  |  p)  for  n  =  5,  10,  15,  20.  Since  opposite 
extreme  points  encompass  most  of  the  probability  mass  the  variance 


is  quite  large.  The  variance  is  maximized  for  p  =  0  where 

Var  (Rn  j  p  =  0)  =  ~~  ;  in  general  the  variance  decreases  as 

p:  0 — ►  ~  for  each  n  and  increases  as  n  increases  for  each  p  . 

In  comparison  with  R*  ,  Var  (R+  |  p)  =  Var  (R^  j  p)  for  .  35  <  p  <  .  65 

and  Var  (R*  |  p)  <<  Var  (Rn  |  p)  for  p  near  0,  1  . 

The  conditional  random  variable  p  is  studied  next.  Note 

rn,* 

that  0<  p  <  n  for  all  reasonable  size-«  tests  and  each  point 
Kn,oc 

je  {l»  •  •  • ,  n-1 }  is  a  possible  estimator  for  p  when  the  null  hypothesis 
is  rejected. 


The  densities  of  ^  ^  for  P  =  •  1,  .3,  .5  and  for  the  six 
largest  values  of  and  the  smallest  value  of  which  is  greater 

than  1  are  listed  in  Table  26;  for  each  p  the  last  line  lists 


¥ 
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Table  26.  =  j|p). 


p=.  1 


3 

j 

& 

j=  1 

j=2 

j=3 

j=4 

j=5 

2/1 

1 . 9 

.0872 

.4444 

.  0494 

.0055 

.0006 

.0001 

16/9 

1,9 

.1517 

.4681 

.0284 

.0032 

.0004 

.0001 

14/8 

2.8 

.1588 

.4483 

.0483 

.0030 

.0003 

.0001 

12/7 

3,7 

.1591 

.446  5 

.0481 

.0050 

.0003 

.0901 

5/3 

4,6 

.1592 

.4462 

.0481 

.0050 

.0007 

.0001 

8/5 

5 

.1595 

.4453 

.0480 

.0050 

.0007 

.0021 

22/21 

3,7 

.2083 

.407  5 

.  0432 

.0365 

.0048 

.0158 

6/10 

0,10 

1.0000 

.0860 

.0663 

.0110 

.0230 

.0291 

p=.3 

S 

j 

<X 

j=l 

j=2 

j=3 

j=4 

j=5 

2/1 

1 . 9 

.0424 

.2862 

.1233 

.0543 

.0266 

.0193 

16/9 

1,9 

.1207 

.4249 

.0433 

.0191 

.0093 

.0068 

14/8 

2,8 

.1528 

.  3355 

.1394 

.0150 

.0074 

.0053 

12/7 

3.7 

.  1664 

.  3082 

.  1281 

.0545 

.0068 

.0049 

5/3 

4,6 

.  1737 

.2951 

.  1227 

.0522 

.0277 

.0047 

8/5 

5 

.1805 

.2841 

.1181 

.0502 

.0266 

.0420 

22/21 

3.7 

.6054 

.2747 

.1052 

.0722 

.0276 

.0406 

6/10 

0.10 

1.0000 

.  1774 

.  1214 

.0598 

.0524 

.0560 

p=.5 

s 

j 

C* 

j=l 

j=2 

j=3 

j=4 

3=5 

2/1 

1.....9 

.0176 

.1111 

.1111 

.  mi 

.1111 

.1111 

16/9 

1,9 

.0469 

.  3542 

.0417 

.0417 

.0417 

.0417 

14/8 

2,8 

.0742 

.2237 

.  2105 

.0263 

.0263 

.0263 

12/7 

3.7 

.0977 

.  1700 

.  1600 

.  1400 

.0202 

.0202 

5/3 

4,6 

.1152 

.  1441 

.1356 

.1186 

.0932 

.0169 

8/5 

5 

.  1270 

.1308 

.1231 

.  1077 

.0846 

.  1077 

22/21 

3,7 

.7813 

.  2263 

.1312 

.0837 

.0375 

.0425 

6/10 

0,10 

1.0000 

.  1927 

.  1465 

.0791 

.0485 

.0449 

t 
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P(R^q  =  j  j  p)  for  comparison.  The  densities  for  Pjq  ^  are  s imila.r 
to  those  for  R^  in  that  the  extreme  points  are  most  probable;  for  p 
near  0  the  extreme  points  j  =  1,  9  are  more  probable  for  p  than 

for  R. 


10 


n 


The  mean  of  p  is  —  for  each  n,  pe(0,  1),  of  .  The  variance 
n,  cK  2 

of  the  conditional  random  variable  is  sketched  in  Figure  27  for 
pe(.  1,.5)  and  s+  =  2/1,  16/9,  8/5,  22/21,  6/10;  symmetries  occur 
in  p  and  1-p  .  Similar  to  the  behavior  of  the  variance  of  R^,  the 
conditional  variance  tends  to  be  large  for  p  small  and  to  decrease  as 

P 

Var  (Rjq  |  P)  •  Note  however  that  Var(RjQ  j  p)  is  maximized  for 
p  a  0  whereas  Var(p 

1  Oj 

the  tendency  is  that  as  s  decreases  to  the 


p  increases  towards  — ;  for  p  sufficiently  small  Var  (p^q  ^ 


p  =  0)  =  0  for  >  1  .  Due  to  the  nature 


of  the  densities  of  p 


10,  of. 


next  smaller  value  s  ;  the  conditional  variance  increases  when 

of.  P 

s  occurs  nearer  to  i  =  l,n-l  than  the  previous  value  of  s  .  For 
of,  p 

p  in  a  neighborhood  of  ,  Var  (Pj+q  ^  |  p)  -  Var  (pjQ  ^  |  P)  and 
Var  (pj+0  ^  |  p)  <<  Var  (p10  ^  |  P)  P  nearer  to  0,  1  . 


2.9  Large  Sample  Derivations 

Expressions  defining  critical  regions  containing  more  than 
+  1  values  of  the  test  statistics  appear  to  be  intangible  for  arbi¬ 
trary  n  .  The  reason  is  apparent  from  the  procedure  for  counting 
employed  in  Section  2.  2.  Although  the  values  attained  by  the  statistics 
as  functions  in  n  are  available  from  the  lists  constructed  in  L  of  the 


Table  28.  (Continue^ 


Table  28.  (Continued) 


Table  28.  (Continued) 


125 


procedure,  one  must  then  be  able  to  order  at  least  the  larger  values 
of  the  statistics  within  each  category  of  a  fixed  number  of  -l's  in 
the  sequence  to  count  the  c^(s+,  j)'s  and  c  (s,  j)'s  as  in  III.  This 
ordering  cannot  be  accomplished  for  arbitrary  n  except  for  the 
most  extreme  values.  Table  28  lists  the  ^J+l  largest  values  of 
the  test  statistics  and  other  pertinent  information  needed  for  calcu¬ 
lating  probabilities.  For  each  p  and  large  n  the  critical  regions 
containing  only  these  values  will  have  very  small  probability. 

Even  though  larger  exact  critical  regions  for  the  tests  for 
arbitrary  n  cannot  be  derived,  observe  from  Table  9  that  for  each 
n  <  20,  P(S*  >l|p  =  .l)  =  .108  and  similar  but  less  emphatic  state¬ 
ments  can  be  made  for  p  >  .  1.  It  appears  as  though  approximations 
to  critical  regions  may  be  obtainable  for  any  n  and  p  ,  but  the 
method  of  proof  for  establishing  stability  properties  and  providing 
bounds  in  the  error  in  approximation  is  evasive  at  present. 

2.  10  Example 

The  following  example  illustrates  a  change  point  testing  situation. 

Measurements  of  the  discharge  of  the  Missouri  River  at  the 
Omaha  Station  are  taken  throughout  the  year  with  the  peak  discharge 
occurring  in  spring.  This  greatest  flow  may  vary  considerably  from 
year  to  year:  when  snowfall  has  been  heavy  and  spring  thaw  quick  the 
peak  discharge  may  be  quite  large.  It  is  the  peak  discharge  that  is  of 


J 
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interest  for  flood  control  purposes.  We  test  for  a  downward  change 
in  the  peak  discharge  during  the  interval  of  years  1945-1964. 

Table  29  lists  the  peak  discharges  y  ,  •••,  y^Q  for  the  years 
1945-1964.  Since  the  data  are  so  variable  the  median  of  the  peak  dis¬ 
charge  is  used  as  a  measure  of  central  location  for  constructing  the 
test.  An  estimate  m  of  the  median  under  the  null  hypothesis  is 
found  from  the  peak  discharges  for  the  years  1930-1940  as  listed  in  the 
same  table;  we  find  m  =  85,  600  cubic  feet/second.  Define 


1  if  y.  >  85,  600 


-1  if  y.  <  85,  600 


,  i  =  1,  •  ♦  20,  and  for  these  transformed  data 


test  Hq:  p  =  20,  p  =  j  vs.  H*  :  p  <  20,  p  =  ~ ,  q,  p  >  q  .  Using  the 
x.1  s  as  computed  in  Table  29  we  find  that  S^p  =  and  R*Q  =  9  . 


Now  P(S*  > 

'20—99 


p  =  —  )  =  .057  and  when  Hp  is  rejected  at  this 


significance  level,  a  decrease  in  the  median  peak  discharge  is  esti¬ 
mated  to  have  occurred  in  1954.  River  flow  controls  were  incidently 
instigated  upstream  from  Omaha  at  this  time;  the  change  point  test 
detected  a  simultaneous  decrease  in  peak  discharge. 

Note  that  the  change  point  test  detects  a  decrease  in  peak  dis¬ 
charge  of  the  river  with  fewer  than  20  observations.  If  for  each  set 


of  15  consecutive  years  between  1945  and  1964  one  calculates  S  , 

I  5 

one  finds  that  S+,.  (x^  •  •  • ,  ^  ,  S+5  (*2>  •  • .,  x^)  ' 

^15  *3'  ***'X17  =  4  '  S15,V--”l8,  =  T-  S15  ( V  ’ '  •  x19)  =  5 

+  18 

and  S  (x  ,  •  •  • ,  x,n)  =  —  ;  the  probability  of  each  value  being  at 
15  6  20  1 1 
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Table  29. 


Peak  Discharge  y,  in  Cubic  Feet/Second  of  the 
Missouri  River  at  the  Omaha  Station. 


Y  ear 

y. 

i 

Year 

yi 

X. 

1 

1930 

62,400 

1945 

106,000 

1 

1931 

45, 000 

1946 

81 , 900 

-1 

1932 

104,000 

1947 

147,000 

1 

1933 

89,700 

1948 

97,100 

1 

1934 

32,900 

1949 

169,000 

1 

1935 

85,600 

1950 

194,000 

1 

1936 

78,500 

1951 

86,200 

1 

1937 

110,000 

1952 

536,000 

1 

1938 

114,000 

1953 

111,000 

1 

1939 

140,000 

1954 

80,300 

-1 

1940 

34,300 

1955 

34,700 

-1 

1956 

36,900 

-1 

1957 

38,000 

-1 

1958 

31,600 

-1 

1959 

37,100 

-1 

1960 

117,000 

1 

1961 

30, 300 

-1 

1962 

103,000 

1 

1963 

32,700 

-1 

1964 

35,700 

-1 
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least  so  large  is  .  004,  .067,  .004,  .041,  .022  and.  019,  respectively. 

For  each  set  of  observations  R*  is  the  point  corresponding  to  the 

1 5 

year  1953;  rejecting  the  null  hypothesis  results  in  estimating  a 
decrease  in  greatest  annual  flow  beginning  in  1954. 


Similarly  for  each  set  of  10  consecutive  years  one  calculates 
for  S1Q  the  values  —  ,  -,  2,  2,2,  2,  —  ,  j  .  —  and  —  ;  the 
probability  of  each  value  being  at  least  so  large  is  .  024,  .  038,  .  008, 


.008,  .008,  .  008,  .049,  .038,  .  119,  .797  and. 797  ,  respectively. 
For  the  first  nine  values  R^"  1®  the  point  corresponding  to  the  year 

1953;  for  the  last  two  values  Ri0  (x10>  *  *  * »  x19>  =  R|0  (xn»  ’  •  * .  X2Q) 
=  0  . 


Also  for  each  set  of  5  consecutive  years  between  1950  and  1957, 

S*  =  2  and  P(S^  =  2  |  p  =  ^  )  =  .  125.  For  these  sets  of  observations 
R*  corresponds  to  the  year  1953;  rejecting  the  null  hypothesis  for  size 


of  test  o(  =  ,  125  results  in  estimating  a  decrease  in  peak  discharge 

beginning  in  1954.  For  each  other  set  of  5  consecutive  years  S*  <  2  . 

5 

Next  we  investigate  whether  Page's  test  statistic  also  detects  a 


change  in  peak  discharge. 
Page's  test  statistic  is  M 


For  the  one-sided  test  with  p  =  —  >  q 

2 

=  min  S  -  max  S.  where  S  =  S  X.  , 

L  r  i  J  r  i 

0<r<n  0<  i<  r  i=l 


r  =  1,  •  •  • ,  n,  and  SQ  =  0  .  For  n  =  20,  15,  10  and  o<  =  .  05,  the  critical 
regions  contain  those  sequences  for  which  M<  -10,  -9,  -7,  respec¬ 
tively.  Critical  regions  for  these  values  of  n  and  were  found  by 
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using  tables  constructed  by  Page  (1955),  performing  calculations 
for  smaller  n  than  listed  in  the  tables  and  interpolating.  For  the 
transformed  data  x  M=  -7;  for  each  set  of  15  consecutive 

years  between  1945  and  1964,  M  >  -7;  and  for  each  set  of  10  consecu¬ 
tive  years,  M  > -6.  Hence  in  none  of  these  instances  does  Page's  test 
detect  a  change  for  size  of  test  =  .  05.  For  n  =  5  and  tx  =  .  05,  the 
critical  region  contains  the  sequence  -1-1 -1-1-1.  For  the  two  sets 
of  years  1954-1958  and  1955-1959,  Page's  test  detects  a  change  in  peak 
discharge  for  this  size  of  test  and  the  change  is  estimated  to  have 
occurred  before  sampling  began. 


CHAPTER  3 


ALTERNATIVE  HYPOTHESIS  DISTRIBUTIONS 


The  alternative  hypothesis  distributions  of  the  test  statistics 

and  the  change  point  random  variables  are  found  from  the  joint  distri 

buttons  (S+,  R+  I  p,  p,  q)  and  (S  ,  R  I  p,  p,  q).  Obtaining  expres 
n  n  l  n  n  i 

sions  for  the  joint  distributions  is  an  incredibly  tedious  task;  results 
facilitating  this  task  are  presented  in  Section  1  and  application  of 
these  results  is  illustrated  in  Section  2.  Properties  of  the  point 
distributions  are  derived  in  the  next  two  sections;  from  these  prop¬ 
erties  the  behavior  of  the  marginal  distributions  of  the  test  statistics 
and  estimators  of  change  points  are  deduced. 


3.  1  Notation  and  Tools 


Let  c^s.j  |p,a)  denote  the  number  of  sequences  xi*’'*'xn 

with  k  1’  s  and  n-k  -1*  s  that  have  S  (x, ,  •  •  • ,  x  )  =  s,  R  (x, ,  •  •  • , 

n  l  n  n  i 

x^)  =  j  and  a  l's  in  the  first  p  places.  Thus 
3.1.1  P  {(Sn,  Rn)  =  (s,  j)  |  p,  p,  q  } 


n  p 

=  E  S  c  (s,j 
k=0  a=0 


*  a/,  \P’a  k_a  ,,  .n-p -(k-a) 

p,  a)  p  (l-p)K  q  (1-q)  H 


n 

Define  c.  (s  I  p,  a)  =  E  c.  (s,j  Ip,  a)  so  that 
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r 

3.1.2  =  6  [  p,  p,  q)  =  2  2  c^s  I  p,a)pa(l-p)P  a 

k=0  a=0 

•  qk-a(i.q,n-'>-<k-41 


and  C  (s  p,  a)  =  2  c  (s  p,  a)  so  that 

k  *P  '  s>s  k  ' 

-  or,  p 


3.1.3  P(Sn>s  |p,p,q)=  |  %  C  (s  p,a)pa(l-p)p-a 


<*,  P 


k=0  a=0 


k\  ° 

a,  p 


k-a,,  n-p  -(k-a) 

•  q  (1-q)  K 


Define 


dk(j  I  ?•  a,  8  )  =  2  C  (s,  j  P,  a)  so  that 


*'p'  s>s  k 
-  at,  p 


3.1.4  P(Pn,<*  =  •>  P»P»*> 


n  p 


2  2  \(j  |P.a,s ■  )pa(l-P)p”aqk‘a(l-q)n’p'(k-a) 

k=0  a=0  *'p 


P(Sn^8o<,p  lP'P'q) 


Similarly  define  c^(s+,j  |p,a),  c^(s+  |  p,  a)  ,  C^(s^  |  p,  a)  and 

VJ  |  P.  a,  s^  p)  . 

The  ck(s  ,  j  |  p,  a)1  s  and  0^(8,  j  |p,  a)’s  must  be  counted  to 
find  the  joint  distributions  under  the  alternative  hypothesis.  Results 
useful  for  minimizing  the  effort  needed  to  count  these  quantities  are 


now  presented.  The  one-sided  testing  situation  is  considered  first 
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Theorem  1.  For  each  n,  k,  s+,j,  p  and  a,  c^(s+,  j  |  p,  a) 

=  c*  v(s+,  n-j  n-p,  n-p-a(k-l));  hence  P{(S+,  R+)=(e+,  j)|  p,  p,  q} 
h-k  1  n  n  1 

=  P{(S^,  R^)  =  (s+,  n-j)  |  1 -q,  n-p,  1 -p  }  for  all  p  and  q. 

Proof.  If  S+(x,  ...,x  )  occurs  at  j  and  if  x,,.*.,x  has  k  l's 
'  ■  n  i  n  In 

with  a  l's  in  the  first  p  places,  then  S+( -x  ,  •••,-x.)  occurs  at 

n  n  1 

n-j  and  ~xn>  “ '  >  ~x}  has  n-k  1,8  with  n-p-(k-a)  l's  in  the  first 
n-p  places.  | 


Hence  the 


c^(8+.  j  |  p»  a)' s 


need  only  be  counted  for 


k~[f]  or  for  P<[f]  or  for  j-  [f]  * 
need  only  be  carried  out  for  j<  or  for 
joint  probabilities. 


Numerical  computation 
pc  (0,^-1  to  evaluate  the 


Corollary  2.  (i)  For  each  n,  k,  s+,p  and  a,  c^(s+|p,a) 

=  cn-k*8+  i  n"P»  n-p-(k-a))  and  C^(s+  |  p,  a)  =  C*_k(s  +  |  n-p,  n-p  -(k -a)). 

Thus  P(S*  >  s+  |  p,  p,  q)  =  P(S*  >  s+  |  1  -q,  n-p,  1  -p)  for  all  p  and  q  . 

4*  *f  I  + 

(ii)  For  each  n,  k,  j,  p,  a  and  s  ,  d,  (j  p,  a,  s_,  ) 

<X,  P  K  '  «,  p 

*  d._k<n-i  |  n-p.  n-p -(k-a),  •  *  )  .  Thu»  P(p^  =  j  |  p,  p.  q> 


■  P(C  *  "-j 


1-p,  n-p,  1-q)  for  all  p  and  q  . 
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p  +  + 

As  a  check  when  counting  note  that  2  c^(s  '  J  P»  a)  = 


a=0 


+.  + 


+,  + 


+,  + 


c  (a  ,  j),  2  c  (s  p,  a)  =  C  (s  )  and  2  d.(j  p,  a,  e  ) 

05  9  P 


a=0 


a=0 


d.  (j  I  s  )  for  each  k,  s+,  j,  p,  a  and  s  + 
k  I  p  J  K  <x,  p 


Similar  results  hold  for  the  two-sided  testing  situation. 


Theorem  3.  For  each  n,  k,  s,  j,  p  and  a,  c^(s»  j  |  p,  a)  *  ck(s,  n-j  |  n_P» 
k-a)  =  cn_k(8,  j  |  p,  p  -a)  =  cn_k(s,  n-j  |  n-p,  n-p -(k-a));  hence  P{(Sn,Rn) 

=  (».  i)  j  p.  p.  q>  =  p(<sn>  Rn)  -  (*>  n'J)  {  q*  n-p,  p}  =  P{(sn,  Rn)  = 

(s,  j)  |  1 -p,  p,  1 -q}  =  P{(Sn<  Rn)  =  (s,  n-j)  |  1 -q.  n-p,  1 -p}  for  all  p 
and  q  , 


Proof.  If  Sn(x^,  •  •  • ,  xn)  occurs  at  j  and  if  x^,  •  •  • ,  x^  has  k  l's  with 

a  l's  in  the  first  p  places,  then  ®n^xn»  *  *  ’ »  xj )  occurs  at  n-j  and 

x  ,  •••,Xj  has  k  l's  with  k-a  l's  in  the  first  n-p  places,  and 

* 

S  (-x  ,  •  •  • ,  -x  )  occurs  at  j  and  -x,,*»*,-x  has  n-k  -l's  with 
n  i  n  In 

p-a  l's  in  the  first  p  places.  H 


Hence  the  c^(s,  j  |  p,a)'s  need  only  be  counted  for  k<_|^J  and 
j<_^J  or  for  k<  and  p<  .  Numerical  computation  need  only 
be  carried  out  for  j<  and  pc(0,  i-J  . 


Corollary  4.  (i)  For  each  n,  k,  s,  p  and  a,  c^(s  |  p,  a)  =  c^(s 

=  cn_k(s  |  p»p-a)  =  cn.k(8  |  n'P»  n-p-fk-a))  a«d  C  (s  |  p,  a)  = 
-  Ck(.  |  n-p,  k-a)  =  Cn.k(®  |  p.  p-a)  =  cn_k(*  |  n-p,  n-p -(k-a))  , 


n-p,  k-a) 


Thus 
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P(Sn  >  S  |  p 

>p»q)  = 

P(Sn  >  s  |  q,  n-p, 

P)  = 

P(S  >  8  1  1  - 
n  ~  1 

p»  p»  i  -q) 

=  P(S  >  s 
n  — 

1  1  -q.  n-p 

»,  1  -P)  • 

(ii)  : 

For  each 

n,  k,  j,  p,a  and 

ScX,  p' 

dk(j  |  P»  a-  « 

p>  ■ 

dk(n-j  |  n-p 

,  k-a,  s 

p>  =  dn-k(j  |P'P- 

a,  8 

p> ’  dn-k(n-j 

|  n-p,  n-p- 

(k-a),  s  ) 

oi,  p 

.  Thus 

P(pn,<x  =  J  |p'p‘ 

.  q)  = 

P®„.«  *  "'J  1 

q,  n-p,  p)  = 

P(^.«  *  J 1 

1  -p»  p,  1 

-q)  "  P(pn,*  =  n 

-j  |i 

-q,  n-p,  1  -q)  . 

Again,  as  a  check  when  counting  note  that  £  c^fs,  j  J  p,  a)  = 

a=0 

P  |  P  , 

c,(a,j),  2  c  (s  p,  a)  =  c  (s)  and  £  d  (j  I  p,  a,  s _ )  = 


a=0  a=0 

d  (j  Is  )  for  each  k,  s,  j,  p,  a  and  s 


<*»  P 


<*,  P 


<*»  p 


Properties  of  the  test  statistics  and  change  point  random  vari¬ 
ables  under  the  alternative  hypothesis  are  of  interest  when  the  null 
hypothesis  is  rejected,  so  expressions  for  the  joint  distributions  need 
only  be  obtained  for  s+  and  s  in  the  critical  regions  of  the  tests. 
Since  P(S+  >  1  I  p)  and  P(S  >  1  I  p)  are  very  large  for  each  n  and 
p,  the  critical  regions  contain  only  values  s+  >  1  and  s  >  1  . 


3.  2  The  Method 

The  method  used  to  find  expressions  for  the  joint  distributions 
uses  the  information  obtained  in  III.  of  the  procedure  for  counting  in 
Section  2.2.  Available  from  III.  are  the  c^(s+,  j)'s,  c^(s,  j)'s  and 
the  types  of  sequences  yielding  each  value  of  the  test  statistic.  The 
manner  in  which  the  c^(s+,  j  |  p,a)'s  and  c^(s,  j  |  p,a)'s  are  found 
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from  this  information  is  straightforward:  for  each  p,  1  <  p  <  n-1, 
count  the  number  of  sequences  which  have  a  l's  in  the  first  p  places, 
a  <  p  .  The  only  means  for  reducing  the  counting  task  are  the  results 
in  the  previous  section. 

Application  of  the  results  in  the  previous  section  are  illustrated 
by  finding  expressions  for  the  joint  distributions  for  n  =  5  . 

Example  1.  The  c*(s+,j  I  p,a)'s  are  found  for  n  =  5  and  s  +  =  2,  — ,  ^  , 
K  '  2  3 

Expressions  for  the  joint  distributions  are  listed  below. 

P((S5  *  R5  *  =  (2»  I  P*P»  V  ^ 


p  =  1 

p(i-q)4 

p  =  2 

p(l-p)(l-q)3 

p  =  3 

p(l-p)2(l-q)2 

p  =  4 

p(l-p)3(l-q) 

p  =  1 

pq(l-q)3 

2,  3 

p  =  2 

p  (i-q) 

p  =  3 

p2(l-p)(l-q)2 

p  =  4 

p2(l-p)2(l-q) 

p  =  1 

2/,  \  2 
pq  (i -q) 

2  ,  2 

C4 

II 

a 

p  qd-q) 

p  =  3 

p  (i-q) 

P  =  4 

p3(l-p)(l-q) 
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j  =  4 

P  =  1 

pq3(l-q) 

2  2, 

(M 

II 

Q. 

p  q  (i-q) 

P  =  3 

p3q(i  -q) 

p  =  4 

p4(i  -q) 

p{(S+  Rg)  =  (|,j)  |  p,p,q) 


j  =  i 

p  =  1 

3pq(l -q)3 

P  =  2 

3p(l  -p)q(l  -q)2 

P  *  3 

p2(l-p)(l-q)2  +  2p(l  “P)2q(l  -q) 

P  =  4 

2pZ(l  -p)2(l-q)  +  p(l-p)3q 

j  =  4 

p  =  1 

2pq2(l-q)2  +  (l-p)q3(l-q) 

p  =  2 

p2q(l-q)2  +  2p(l -p)q2(l-q) 

P  =  3 

3p2(l  -p)q{l-q) 

P  =  4 

3p3(l-p)(l-q) 

P{Sj, 

R5>=(? 

*  J>  |p-p*q) 

2,  >2 

j  =»  2 

P  =  1 

pq  (i-q) 

2  ,  ? 

p  =  2 

p  qd-q) 

P  =  3 

p2(i  -p)q(i  -q) 

2,  ,2 

P  =  4 

p  (1-p)  q 

j  =  3 

p  =  1 

(1  -p  )q2(  1  — q  )2 

p  =  2 

p(l-p)q(l-q)2 

L 


!l 
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p  =  3  p2(l-p)(l-q)2 

p=4  p2(l-p)2(l-q) 

When  (s+,  j)  =  (|-,4)  ,  for  instance,  the  types  of  sequences  involved 

are  T  (4,  3)  \  T  (3,  3)  .  There  are  three  such  sequences:  -1111-1, 

5  5 

1-111-1,  11-11-1  .  For  p  =  1,  two  sequences  have  a  1  in  the  first 

+  3  I 

place  and  one  sequence  has  a  -1  in  the  first  place;  I  ^  = 

c+(I,4  j  1,0)=  1  .  For  p  =  2,  one  sequence  has  2  1' s  in  the  first 

two  places  and  two  sequences  have  a  1  and  a  -  1  in  the  first  two 

places;  c*  ( j’ ,  4  |  2,  2)  =  1  ,  c^  ( |- ,  4  |  2,  1 )  =  2.  Forp  =  3,all 

three  sequences  have  2  l's  and  1  -1  in  the  first  three  places; 

c+  ( — ,  4  I  3,  2)  -  3  .  For  p  =  4  ,  all  three  sequences  have  3  l's  and 
3  2  * 

+  3  t 

1  -1  in  the  first  four  places;  ,  4  |  4,  3)  =  3  .  Note  that  expres- 

4“  3  i 

sions  for  P{(S  ,  R  )  =  (7,  1)  p,  p»q)  can  be  obtained  from  those 

3  y  fa 

for  P{(S*,  R^  )  =  (  —  ,  4)  |  p,  p,q}  according  to  Theorem  1 . 

1  3  4 

Example  2.  The  c^s.j)  |  p,a)'s  are  found  for  n  =  5  and  8  =  2»  J  •  J  • 

Expressions  for  the  joint  distributions  are  listed  below. 


p{(S5, r5)  =  (2,  j)  j  p, p, q) 


j  =  1 


p  =  1  p(l  -q)4  +  (1  -p)q4 
p  =  2  p<l-p)(l  -q)3  +  p(l-p)q3 
p  =  3  p(l -p)2(l -q)2  +  p2(l -p)q2 
p  =  4  p(l  -p)3(l  -q)  +  p3(l-p)q 
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j  =  2 

-Q 

(1 

pq(l  -q)3  +  (l-p)q3(l  -q) 

CM 

1) 

CL 

2  3  2  3 

P  (1-q)  +  (l-p)q 

"D 

II 

OJ 

p2(l-p)(l-q)2  +  p(l-p)2q2 

II 

Q. 

p2(l-p)2{l-q)  +  p2(l-p)2q 

j  =  3 

p  =  1 

pq2(l-q)2  +  (1  -p)q2(l-q)2 

p  =  2 

p2q(  i -q)2  +  (i  -p)2q2(i -q) 

p  =  3 

P3(l-q)2  +  (l-p)3q2 

II 

Q. 

p3(l-p){l-q)  +  p(l-p)2q 

j  =  4 

p  =  l 

pq3(l-q)  +  (l-p)q(l-q)3 

P  =  2 

p2q2(l-q)  +  <l-p)2q(l-q)2 

p  =  3 

p3q(l-q)  +  (l-p)3q(l-q) 

II 

Q. 

p4(l-q)  +  (l-p)4q 

p«s5. 

R5)  =  ( 

j)  j  p.  p.  q  ) 

j  =  1 

p  =  1 

|pq(l-q)3  +|(l-p)q3(l-q) 

P  =  2 

|-p(l -p)q(l -q)2  +  |-p(l  -p)q2(l  -q) 

p  =  3 

p2{l  -p)0  -q)2  +  J  p(l  -p)2q(l-q)  +  p(l  -p)2q2  + 

|-p2(i  -p)qd  -q) 

p  =  4  2pz(l  -p)2(l-q)  +  jp(l-p)3q  +  2pz(l  -p)2q  + 

jp3(i-p)(i-q) 
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j=4  p=l  2pq2(l -q)2  +  i- (1 -p)q3(l -q)  +  2(1 -p)q2(l -q)2  + 

jpq(l-q)3 

p  =  2  p2q(l-q)2  +  |-p(l -p)q2(l-q)  +  (1  -p)2q2(l  -q)  + 

|-p(l-p)q(l-q)2 
P  =  3  |-p2(l -p)q(l-q)  +  |p(l  -p)2q(l-q) 

p=4  |-p3(l-p)(l-q)  +  |-p(l-p)3q 

P^S5*  R5)  =  *  j*  |  P*P»  9) 

j=2  p=l  pq2(l -q)2  +  (1 -p)q2(l -q)2 

p=2  P2q(l -q)2  +  (1 -p)2q2(l -q) 

p  =  3  p2(l  -p)q(l  -q)  +  p(l  -p)2q(l  -q) 
p=4  P2(l -p)2q  +  p2(l -p)2(l -q) 

j  =  3  p  =  l  (1  -p)q2(l  -q)2  +pq2(l-q)2 

p  =  2  P(1 -p)q(l -q)2  +  p(l -p)q2(l -q) 

p  =  3  p2(l  -p)(l  -q)2  +  p(l  -p)2q2 

P=4  p2(l-p)2(l-q)  +  p2(l-p)2q 

Consider  the  pair  (s,j)  =  (|-,  4);  the  cR(  | ,  4|p,a)'s  need  only 
be  counted  for  •  The  three  sequences  involved  are  -1111-1, 

1-111-1,  11-11-1;  the  first  sequence  has  J  =1,4  so  it  contributes 
\  to  the  count.  Note  that  c3(|,4  |  1.  1)  =  2  ,  c^i.4  |  1,  0)  =  , 
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(1,4  2,  2)  =  1,  c(f-,4  2,1)  =  -,  c  (|-,4  3,2)=-  and 


3' 4  ''I-’"'  -  -  3'  2  '  *  I -  2  '  ’3' 2  ’  *  '  2 

c3(|,4  |  4,3)  =  |.  The  | ,  4  |  p,  a)*  a  ,  | ,  1  |  p,  a)‘s  and 

c  ( 2.  f  i  I  p>  a)1  s  can  be  found  from  the  c  ( |- ,  4  I  p,a)'s  according 
3  2  I  * 


to  Theorem  3. 


The  marginal  distributions  S*  ,  P5  ^  '  ^5  an<^  P5  <x  ma^  136 
obtained  from  the  joint  distributions. 


3,  3  The  One-Sided  Testing  Situation 

The  behavior  of  the  joint  densities  (S^,  Rn)  is  analyzed;  the 
resulting  behavior  of  the  power  function  and  estimator  for  the  change 
point  are  discussed.  Lastly,  properties  of  q  are  investigated. 


3.  3,  1  The  Joint  Densities 

The  following  result  describes  the  local  behavior  of  the  joint 
density. 

Theorem  5.  For  each  n,  s+,  j,  p  and  q  where  p  >  q, 

P{(S+,R+)  =  (s+,  j)  Ip,  p=j,  q}  >P{(S"\  RV(s+,  j)  |p,  p=j  +  l,q}. 
n  n  1  u  u 


T  “r  t 

Proof.  Suppose  x^,  •  •  • ,  has  S^  =  s  and  R^  =  j  ;  then 

p(x  , •  •  • ,  x  Ip, p=j»q)  =  aqk  a(i-q)n  j  (k  a)  if  x,»,**'x- 


1 


r 


l  .  ,  a-l._  ,j-a  k+1  -a 

has  k  l’s.  Also  P(x  ,  •  •  • ,  xr  |  p,  p=j-l,  q)  =  p  (1-p)  q 

•  (1  -q)n "j'^k‘a^  and  P(xjf  •  •  • ,  x^  | p,  p=j+l,  q)  =  P  (1-p)^ 

V 

•  q 


^*a(l -q)n’^ since  x^  »  1  and  x. 


j  +  1 


=  -1  . 
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Unfortunately,  Theorem  5  cannot  be  generalized  to  state  that 
P{(S+,  R+)  =  (s+,j)  |p,  p  =  j+i,  q  }  P{(S+R*)  =  (s+,j)|p,p=j+(i+l),q}; 
note  that  P{  (S*.  R*)  =  (j  ,  1 )  |  p,  p=2,  q  }  =  3p(l  -p)q(l  -q)2>p2(l  -p)(l  -q)Z 
+  2p(l-p)2q(l-q)  =  P{(S*,R*)  =  (|,  l)|p,p  =  3,  q}  if  and  only  if  q>j. 

The  following  properties  describe  general  behaviors  of  the  joint 
density;  computation  indicates  that  each  is  true  but  derivation  for 
arbitrary  n  has  not  been  possible. 


Property  1.  For  each  n,  s+,  i  >  0,  p  and  q  where  p  >  q  , 

P{(S+,  R+)  =  (s\  j)  I  p,  psj+1,  q  }  >  P{(S*  R+)  =  (s+,  j)  I  p,  p=j+i+l,  q} 
n  n  i  n  n  1 

when 

(i)  j<  [|]  and  ^  <  q  <  P 
and  (ii)  j  >  ; 

P  {(S+.R+)  =  (s+,  j)  |  p,  p=j-i,  q>  >  P{(S*  R*)  =  (s+,  j)  |  p,  p=j-(i+l),  q} 
when 


and  (ii)  j  >  and  q  <  p  <  j  . 


For  j<  j^J  the  first  inequality  in  Property  1  may  be  reversed 
for  some  i  when  p  >  >  q  and  q  is  small  or  when  —  >p  >  q  and 

q  is  small;  for  j>  the  second  inequality  may  be  reversed  for 

some  i  if  p  is  large.  For  j  =  the  inequalities  may  be  reversed 


for  some  i  wher.  p  is  large  and  q  is  small. 
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Summing  the  probability  inequalities  over  values  s+ >  s+  >  1 

«,  p 

yields  global  properties. 


Property  2.  For  each  n,  s  >  1  ,  j,  i  >  0,  p  and  q  where  p  >  q  , 

o<f  p 


and 


2  P{(S*  R+)  =  (s+,j)|p,p=j+i,q} 

P«S+  R*)  =  (s+,  j)  |p,  p=j+i+l,  q} 


+  + 

s  >s 

<X,  p 


>  2 

+.  + 

s  £s 


«,p 

+  o+\  _  /.+  _•» 


2  P{(S^.R^)  =  (s+,j)|p.p=j-i,q} 

>  2  P{(S+  R+)  =  (s+,j)|p,p=j-(i+l),q}. 


S+£S  + 

<X,  p 


+.  + 
a  >s 
-  or,  p 


+  +  + 

That  la,  the  probability  that  S  >  s  and  R  =  j  is  largest 

n  —  ex,  p  n 

when  p  =  3  and  the  probability  decreases  monotonically  as  p 
increases  or  decreases  from  j  . 


Property  3.  For  each  n,  s_^  >  1,  j,  p  and  q  where  p  >  q  , 


S  P((S^,R^)=  {s+,j)|p,p=j+i,q} 

>  2  P{<S+n,Rn>  *  <B+,3)|p>P=j-i>q} 


+.  + 
s  >  s 
“  <x,  p 


s+>s  + 

—  <*,  p 
1 


when  i  >  0  and  p  <  - ,  or  i  >  0  and  q  £  1  -p  <  p,  or  i  <  0  and  1  -p  <  q  <  p. 
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Property  4.  Although  no  monotonic  properties  hold  for  the  joint 

probabilities  when  R+  =  j,  and  R+  =  j,  the  following  general  tendency 
r  nine 

is  observed:  2  P{(S  ,  R*)  =  (s  ,  j, )  |  p,  p=j, »  q  ) 

n  n  I*  X 


+  + 
s  >  s 
“  <*,  P 


>  2  P((S+  R*)  =  (8+,j»)  p.p  =  j,.q  } 


+  + 

S  >  8 

-  <*,  p 


n  n 


when 


jl  <  h  ’  P  -  z  0r  jl<j2'  q<1~p<p  or  j2<  jj,  1-p  <  q  <  p 


For  fixed  p  and  varying  q  the  joint  probabilities  behave  accord¬ 
ing  to  Property  5. 


Property  5. 

P>qi  >q2  . 


For  each  n,  s  >1,  j,  i  >  0,  p.  q  and  q  where 
P  X  £. 


S  PUS*,  R*)  =  (8+,  j)  |p,  P=j+i,  q .} 

n  n  i 


+ 


+  + 

8  >  S 

-  o(,p 


<  2  P{(S^,  R^)  =  (s+,  j)  jp,  p=j+i,  q2  }  . 

+  + 

8  >  S 

-  CX,P 


Thus  the  probability  that  S+  >  s  and  R+  =  j  increases  for 
r  ’  n  —  ck,  p  nJ 

fixed  p  >  j  and  p  as  q  decreases.  For  i  <  0  the  inequality  in 
Property  5  may  be  reversed;  in  general  the  inequality  is  true  for  a 
larger  set  of  i  <  0  and  q^  when  j  is  small  and/or  p  is  large. 


j! 
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3.  3.  2  The  Power  Function 

The  power  function  P(S+^>  s"*"  I  p,  p»  q)  is  analyzed  as  a  function 

n  p  * 

of  p  and  q  .  Power  functions  ior  n  =  10,  20,  =  .  06  and  p  =  .  3,  .5, 

<*  =  ,  10  and  p  =  .  7,  .9,  and  various  values  of  p  are  found  in  Figures 
30.  1-30.  4;  the  figures  illustrate  the  following  typical  behavior  of  the 
power  functions  for  n  <  20  .  For  p  <  the  power  increases  as  q 
decreases  from  p  for  each  p  and  the  power  tends  to  increase  as  p 
decreases  for  each  q  ;  see  Properties  4,  5.  For  small  n  one  can 
show  algebraically  that  the  power  increases  as  q  decreases  for  each 
p  ,  but  the  manipulations  required  to  verify  this  for  larger  n  are 
horrendous.  Also  for  p  <  the  power  tends  to  increase  as  n 
increases  for  each  q  and  p  =  [cn],  c  <  1  . 

For  P  >  ^  the  test  is  slightly  biased;  the  bias  is  occurring  for 
q  near  i-  and  p  small,  as  is  explained  by  Properties  3,  4  .  As  o< 
increases  the  bias  disappears.  A  further  explanation  for  the  bias  is 
provided  by  the  properties  (Al)  and  (A2)  of  P(S*  >  s+  jp)  in 
Section  2.  3;  if  the  tendency  for  the  maximums  to  occur  at  p  =  i- 
for  smaller  s+  values  as  n  increases  is  true,  one  expects  that  the 
power  function  will  remain  biased  for  q  near  and  p  small  as 
n  increases.  The  power  is  not  necessarily  an  increasing  function  in 
a  for  p  >  —  unless  p  and  q  are  sufficiently  small. 

The  power  of  S*  ,  the  generalized  likelihood  ratio  statistic 
and  Page's  test  statistic  M  are  tested  in  Table  31  for  n  =  5  and  p  =  .  4, 


Figure  30.  2.  P(S  >  s  ,  -  j  p=.  5,  p,  q) 
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Table  31. 

The  power  of  S+  ,  C 

n  p,  q 

and  M 

for  a  sample  of 

size  n=5. 

P~»  4 

K 

4 

.  193 

.  172 

.  151 

.  130 

C.  4,  q 

4 

.  194 

.  173 

.  151 

.  130 

M 

4 

.  162 

.  151 

.  140 

.  130 

< 

3 

.  256 

.  209 

.  167 

.  130 

C.  4,  q 

3 

.  292 

.  230 

.  176 

.  130 

M 

3 

.  243 

.  202 

.164 

.  130 

•; 

2 

.  308 

.  234 

.  175 

.  130 

C.  4,  q 

2 

.437 

.  307 

.  206 

.  130 

M 

2 

.  365 

.  269 

.  191 

.  130 

■; 

1 

.  298 

.  222 

.  167 

.  130 

C.4., 

1 

.  656 

.410 

.  240 

.130 

M 

1 

.  547 

.  358 

.  223 

.  130 

153 


.5,  .7,  .9.  To  attain  approximately  the  same  significance  level 

for  each  p  it  was  necessary  to  randomize;  the  procedure  used  was 

to  assign  the  required  fraction  of  probability  to  each  sequence  for 

which  the  value  of  the  statistic  overlapped  the  critical  region. 

The  powers  of  the  tests  were  not  computed  for  p  <  .4  since 

for  the  generalized  likelihood  ratio  test  and  Page's  test  the  null 

hypothesis  cannot  be  rejected  at  a  reasonable  c<  level  for  small  p. 

For  the  test  based  on  S*  one  can  reject  H  for  the  smaller  values 

5  o 

of  p  ,  however,  for  reasonable  levels  since  P(S^  >  s  j  p)  = 

P(S*  >  s  I  1  -p)  for  all  p  . 

5  1 

Table  31  shows  that  S*  has  greater  power  than  M  when  the 
change  occurs  later  in  the  sequence;  M  has  greater  power  otherwise. 
Hence  it  seems  that  the  test  based  bn  S+  is  better  suited  for  detecting 
a  change  later  in  the  sequence  than  is  M  .  Recall  the  difficulty  with 
Page's  statistic  in  estimating  change  points  later  in  the  sequence;  also 
recall  that  the  types  of  sequences  in  the  critical  region  of  his  test  are 
not  in  general  indicative  of  sequences  for  which  a  change  in  the  prob¬ 
ability  of  a  1  has  decreased;  x,,  •  •  •  .  x  and  x  ,  •  •  • ,  x,  give  the 

I  n  n  1 

same  value  of  the  statistic.  For  p  >  ^  it  appears  as  though  M  is 
constructed  to  behave  more  like  a  likelihood  ratio  test  than  a  change 
point  test. 

The  power  of  the  generalized  likelihood  ratio  test  is  larger  than 
that  of  Page's  test  unless  p  >  ^  and  p  is  small. 
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The  power  of  the  test  based  on  S  is  larger  than  the  power 

5 

of  the  test  based  on  C  when  p  is  large;  otherwise  the  opposite 

p»  q 

is  true.  Hence  here  also  it  seems  that  the  test  based  on  S  is  better 

n 

suited  for  detecting  a  change  later  in  the  sequence  than  is  C 

P*  9 

Moreover  C  suffers  from  the  same  drawback  as  M  in  not  being 

p,  q 

able  to  estimate  change  points  later  in  the  sequence;  and  M 

both  worsen  in  this  respect  as  p  decreases.  It  is  incredibly  more 

difficult  to  evaluate  C  for  a  sample  sequence  and  to  obtain  critical 

p,  q 

regions  and  properties  of  the  test  than  with  either  or  M  ;  also 

note  that  although  q  is  in  general  unknown  the  critical  region  of  the 

test  based  on  C  depends  on  q  as  well  as  an  p  . 

P»  q 

Table  32  lists  the  power  of  Page's  statistic  for  n  =  20,  P  =  ^ 
and  <x=  ,05  as  computed  by  Chernoff  and  Zachs  (1964)  and  the  power 
of  S*  for  p  =  <*  =  .  057.  Here  also  S+  has  greater  power  for 

faU  U  tv 

larger  values  of  p  and  M  has  greater  power  otherwise. 

3.  3.  3  The  Estimator  of  the  Change  Point 

Property  2  indicates  that  the  mode  of  the  probability  that 

S+  ^  s*  and  R+  =  j  occurs  at  p  =  j  and  that  the  probability 
n  «,  p  n 

decreases  monotonically  as  p  increases  or  decreases  from  j; 
Property  3  further  describes  the  manner  in  which  the  probabilities 


decrease  as  p  increases  or  decreases  from  j.  These  properties 
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Table  32.  The  power  of  S*  and  Page's  test  statistic  M  for  a  sample 
of  size  n  =  20,  o<  =  .  05. 


S20 

18 

.  098 

.  084 

.  071 

.  057 

M 

18 

.  066 

.  060 

.  055 

.  050 

S+ 

20 

16 

.  125 

.  097 

.  075 

.  057 

M 

16 

.  105 

.  082 

.  064 

.  050 

S20 

14 

.  177 

.  120 

.  079 

.  057 

M 

14 

.  171 

.116 

.  076 

.  050 

S20 

12 

.  207 

.  132 

.  084 

.  057 

M 

12 

.  272 

.  162 

.  092 

.  050 

S20 

10 

.  259 

.  156 

.  090 

.  057 

M 

10 

.407 

.  223 

.111 

.  050 

o 
4-  <M 
CO 

8 

.  316 

.  185 

.  097 

.  057 

M 

8 

.  545 

.  292 

.  132 

.  050 

S20 

6 

.  365 

.  215 

.  104 

.  057 

M 

6 

.  664 

.  364 

.  154 

.  050 

S20 

4 

.  410 

.  249 

.113 

.  057 

M 

4 

.  756 

.  431 

.  176 

.  050 

S20 

2 

.  437 

.  276 

.  120 

.  057 

M 

2 

.  821 

.488 

.  195 

.  050 
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and  the  behavior  of  the  power  function  are  useful  for  analyzing 


P(S  =  j  p,  p,  q)  as  a  function  of  its  parameters, 
n,  x  1 

For  each  n  <  20  computation  indicates  that  P(p^  =j  I  p,  p,  q) 

also  has  a  mode  at  p  =  j;  moreover  as  p  increases  or  decreases 

from  j  ,  the  probability  generally  monotoni cally  decreases.  The 

manner  in  which  P(p  +  =  j  I  p,  p  =  j+l,q)  decreases  for  i  >  0  or 

n,  I 


n,o< 


i<0  is  as  follows:  P(p  =j  p,  p  =  j+i,  q)  >  P(p  =.j  p,  p=j-i,  q) 


n,<K 


when  p<_  — ,  i>0  or  q  <  1 -p  <  p,  i>0  or  l-p<q<p,  i<0. 

As  a  function  in  q  ,  P(p  +  =  j  I  p,  p  -  j+i,  q)  in  general 

n,  x  I 

decreases  as  q — >-p  for  i  >  0  ,  increases  as  q — >.p  jC  ~  ,  for  i  <  0  , 
and  may  increase  and  then  decrease  as  q — ►p  >  j  for  i  <  0  . 


The  magnitude  of  the  modal  probability  P(p  =  j  j  p,  p=j,  q) 

n»c* 

relative  to  2  P(p  +  =j  !  p,  p  =  j+  i,  q)  depends  heavily  on  j,  p  and 

i  n*  «<  ~ 

q.  For  each  j  ,  as  p-q  increases  the  relative  magnitude  of  the 
model  probability  increases;  for  p  <_  the  relative  magnitude  for 
j>^J  tends  to  be  greater  than  that  for  j<£^-J  for  each  I  for  p  >  ^ 
the  relative  magnitude  for  j<^J  tends  to  be  larger  than  for  j>^j 


when  1  -p  <  q  <  p  and  smaller  when  q  <  1  -p  <  p  . 

i  + 

> 

n,<* 


Table  33  illustrates  the  behavior  of  P(p  =  j  I  P>  P>  9)  for 

n,<*  1 

n=10,  j  =  l,  5,  9  and  p  =  .  5, .  9.  Observe  that  the  mode  of  the  prob¬ 
abilities  occurs  at  p  =  j  for  each  j,  p  and  q  .  Also  observe  the 
probabilities  decrease  quickly  for  p  <  j,  p  =  .  5  and  p  >  j,  p  =  .  9 
whereas  the  probabilities  decrease  much  more  slowly  for  p  >  j  , 
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p  =  .  5  and  p  <  j,  p  =  .  9.  Thus  for  example  note  that 


P(p10,.  050  =1  P-5'  P  =  1' 


.  8958 


S  P(p 


10,.  050 


=  1  p=.  5,  p  =  l  +i,  q=.  1 ) 


3.4794 


=  .  2575 


whereas 


p<o„ 


050 


=  9  p=.  5,  p=9,  q=.  1) 


.  1862 


.  8350 


?  P(p10,.  050  =  9|P-5'  ^  ^ 


.  2230 


Since  the  magnitude  of  the  modal  probability  to  E  P(p  ^  =  j  |  p. 


p=j+i,  q)  is  a  measure  of  the  goodness  of  the  estimator  and  since  in 
application  it  is  desirable  to  detect  change  points  quickly,  the  follow¬ 
ing  experimental  design  appears  to  be  optimal.  Dichotomize  the 


observations  Y, ,  •  •  • ,  Y  to  obtain  the  observations 
1  '  n 


X.  = 

v 


1  if  Y.  >  g(9) 


[-1  if  Y.  <  g(0) 


where  9  is  the  location  parameter  under  the  null  hypothesis  and  g 

is  defined  so  that  P(Y.  >  g(0)  I  0)  <  i  ;  thus  P(X.  =  1  I  p  =  . 

l —  1  Z  i  1  2  Z 

Furthermore  sample  the  Y.'s  if  possible  so  that  an  anticipated  change 
point  occurs  late  in  the  sequence. 


3.  3.  4  The  Estimator  of  q 

When  the  estimated  change  point  p  is  greater  than  or  equal  to 
the  actual  change  point  p  ,  the  last  n-£  observations  can  be  used  to 
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x. 


1+1 


of  q  .  If 


obtain  an  unbiased  estimator  q  =  — 2  - 

UfM  2 

biased  estimator  for  q  how- 


p  <  p  the  last  n-p  observations  give 
ever. 


3.4  The  Two-Sided  Testing  Situation 

In  the  last  section  it  was  noted  that  the  one-sided  test  is 
unbiased  for  p  <  ^ ,  but  that  for  p  >  ^  the  test  is  biased  for  p  small 
and  q  near  — ,  and  the  bias  increases  as  p  increases  from  — .  The 

^  L 

structure  producing  the  bias  for  the  one-sided  test  is  present  also  for 

the  two-sided  test  where  under  the  alternative  hypothesis  q  may  be 

greater  or  less  than  p  ;  this  test  is  unbiased  for  p  =  — ,  biased  for 

p  t  4  and  the  bias  increases  as  p  deviates  from  .  The  behavior 
2  2 

of  the  joint  densities  (S  ,  R  )  are  studied  for  p  =  ^  :  the  resulting 

n  n  2 

behavior  of  the  power  function  and  estimator  for  the  change  point  are 
investigated.  Lastly,  properties  of  q  are  discussed. 

3.  4.  1  The  Joint  Densities 

The  following  properties  describe  the  general  behavior  of  the 
joint  density  for  p  =  i.  Property  6  describes  the  local  behavior. 

Property  6.  For  each  n,  s,  j  and  q,  P  {(Sn,  RJ  =  (s,  j)  |  p  =  | ,  p=j,  q) 
>  p((sn»  Rn)  =  (».  j)  |p  =  \  .  P=j±l.  q)  . 
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Summing  the  probability  inequalities  over  values  s  > 
yields  the  following  global  properties. 

Property  7.  For  each  n,  s^  >  1,  i  >  0  and  q, 

Z  P{(S  ,  R  )  =  (s,  j)|  p  =  p  =  j+i,  q} 
n  n  1  2 

8  >  s« 

>  S  p  {(Sn,  Rn)  =  (s,  j)  (p  =  i-,  p  =  j+i+1,  q  } 

8  >  * '  '  C 

for  all  j  ,  and  S  P^Sn*  Rn*  =  (s*  j)|  P  =  i-i»  3  ) 

8  >S« 

>  S  P((Sn»  Rn)  =  (®»  j)  |p  =  f*  P  =  J -i -1#  q ) 
8>8<* 

‘or 

Thus  when  j  <  £ ,  the  probability  that  >  s^  and  =  j  is 

largest  when  p  =  j  and  the  probability  decreases  monotonically  as  p 
increases  or  decreases  from  j;  when  j>  and  p  <  j  this  may  not 
be  true. 

The  following  property  describes  the  manner  in  which  the  prob¬ 
abilities  decrease  as  p  deviates  from  j  . 


A 
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Property  8.  For  each  n,  s^>l,  j,  i>0  and  q, 

2  P((Sn,  Rh)  =  s,  j)[p  =  ^,  p  =  j+i,  q  } 
s>s<* 

2  p((sn-  Rn)  =  (s,  j)  |p  =  p  =  j-i,  q  >  . 
s>8* 


Property  9.  For  each  n,  s^  >  1,  q  and  j<  J^j  ,  the  tendency  is 
that 

2  P{(Sn,  Rn)  =  (s,j)|p  =  1  p  =  j,q} 
s>soc 

£  P^Sn»  Rn>  ~  |p  =  \>  P  =  n-j,  q}  . 

s>a<* 


For  fixed  p  and  varying  q  the  joint  probabilities  behave  accord¬ 
ing  to  Property  10. 


Property  10.  For  each  n, 


B„>1, 


)>  i>  0,  q^  and  where 


qi  <  q2<  P  °r  qi>q2>P' 

2  p{<Sn»  Rn>  =  (s»  j)|  p  =  \>  P  =  j+W  <51  ) 

8  >s<* 

2  p  { (sn»  Rn)  =  (8*  j)  |p  =  p  =  j+i,  q2  )  • 
8>8  ‘ 
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3.4.2  The  Power  Function 

The  power  function  P(S  >  s  I  p,  p,  q)  is  studied  as  a 

n  —  <nt  p  i 

function  of  p  and  q  .  Power  functions  for  ns  10,  20  and  p=  .5, 

.  3  and  various  values  of  p  are  found  in  Figures  34.  1,  34.  2.  For 
p  =  .  5,  =  .  047,  .  049  for  n  =  10,  20,  respectively;  for  p  =  .  3, 

<X  =  .  121,  .  095  for  n  =  10,  20,  respectively.  The  figures  illustrate 
the  following  typical  behavior  of  the  power  function  for  n  <  20.  For 
p  =  .  5  the  test  is  unbiased  and  the  power  tends  to  increase  as  |q-p| 
increases  for  each  p  and  as  p  decreases  for  q  .  Also  the  power 
tends  to  increase  as  n  increases  for  each  q  and  p  =  [cn]  ,  c  <  1. 
For  p  =  .  3  the  figure  illustrates  the  bias  of  the  test  as  q  increases 
from  p  ;  the  test  is  unbiased  for  small  and  large  alternatives  q  . 

3.  4.  3  The  Estimator  of  the  Change  Point 

Properties  7,  8  and  10  describe  the  behavior  of  the  probability 

that  S  >  s  and  R  =  j  as  functions  of  p  and  q  .  Computations 
n  —  <*  n 

indicate  that  the  same  behaviors  occur  for  p  ,  n  <  20.  That  is, 

n,o<  — 

the  mode  of  P(p  =  j  I  P  =  4- »  p,  q)  occurs  at  p  =  j  when  j  > 
and  the  probabilities  decrease  monotonically  as  p  increases  or  de¬ 
creases  from  j  ;  when  j>  the  probabilities  do  not  monotonically 
decrease  as  p  decreases  from  j  .  The  manner  in  which  the  prob¬ 
abilities  behave  for  p  =  j+i,  j-i  is  as  follows:  for  i  >  0, 

P(pn =  j  |p  =  |  .  P  =  j+i»9)  >  P(Pn>*  *  J  |p  =  P  =  J"1*  9)*  As  a 


I 
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function  in  q  ,  p(pn  ^  =  j  |  p  =  j ,  p  =  j+i,  q)  in  general  decrease 
as  q — >-p  for  i  >  0  and  increase  as  q — *p  for  i  <  0  . 

The  magnitude  of  P(p  =  j  Ip  =  ^  ,  p  =  j,  q)  relative  to 

0(  I  u 

2  P{p  =  j  I  P  =  4- ,  p  =  j+i,  q)  tends  to  be  larger  for  each  j  as 

.  n.  oi  I  2 

i 

j  p-q  j  increases;  for  no  value  of  j  is  the  relative  magnitude  largest 
however. 

Table  35  lists  P(p  ^  ^  ^  =  j  |  P  =  ^  »  p,  q)  for  c<  =  .  01 8,  .  047, 

.127,  j  =  1,5,9  and  q  =  ,1,.3  ;  the  probabilities  are  symmetric  in 

q  and  1 -q.  The  relative  magnitudes  of  P(p  =  j  p  =  ,5,  p  =  j, 

q  =  .  1)  to  2  P(p  =  j  |  P  =  .  5,  p  =  j+i,  q  =  .  1 )  for  the  same 

^  10,  £<  1 

values  of  o<  and  j  are  listed  below. 


9 

.2205 

.  3321 

.  2304 

5 

.  2534 

.  2748 

.  3045 

1 

.  3190 

.1784 

.  2754 

j 

.018 

.  047 

.  127 

Since  the  two-sided  test  is  unbiased  for  p  =  —  and  since  in 
application  it  is  desirable  to  detect  change  points  quickly,  the  same 
experimental  design  is  suggested  here  as  for  the  one-sided  test. 

3.4.4  The  Estimation  of  q 

As  for  the  one-sided  testing  situation,  when  p  >  p  the  last 
n-p  observations  may  be  used  to  obtain  an  unbiased  estimator  of  q  . 
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Table  35.  P<P1  Q>0(  =  j  j  P  =  | .  P  »  <J>. 


j 

el.  <*  =  .018 

j  =  5.  <X  =  .018 

’  9.  <*=  .018 

9 

.1109 

.  1109 

9 

.1109 

.  1109 

9 

.1109 

8 

.  1215 

.  1148 

8 

.  1215 

.1148 

8 

.  0266 

.  0830 

7 

.  1379 

.  1229 

7 

.  1379 

.  1229 

7 

.0169 

.  0697 

6 

.  1600 

.  1364 

6 

.  1600 

.  1364 

6 

.  0170 

.  0668 

5 

.  1904 

.  1556 

5 

.  1904 

.  1556 

5 

.  0212 

.  0709 

4 

.  2353 

.  1829 

4 

.  0261 

.  0806 

4 

.  0261 

.  0806 

3 

.  3077 

.  2228 

3 

.  0038 

.  0441 

3 

.  0342 

.  0967 

2 

.4445 

.  2862 

2 

.  0006 

.  0266 

2 

.  0494 

.  1232 

1 

.  8000 

.  4004 

1 

.  0001 

.  0192 

1 

.  0889 

.  1720 

p/q 

' 

.  1 

.  3 

p/ 9 

.  1 

.  3 

p/q 

. 

.  1 

.  3 

j  =  1.  <A  =  .  047  j  =  5.  <*r  .  047  j  =  9,  .X  =  .  047 


9 

.  3542 

.  3542 

9 

.  0416 

.0416 

9 

.  3542 

.  3542 

8 

.4527 

.  3807 

8 

.  062  7 

.  0472 

8 

.  1153 

.  2899 

7 

.4681 

.  3945 

7 

.  0742 

7 

.  0778 

.  2566 

6 

.  4768 

.  4080 

6 

.  0884 

.  0608 

6 

.  0690 

.  2370 

5 

.  4843 

.  4230 

5 

.  1076 

.  0699 

5 

.  0703 

.  2295 

4 

.4947 

.4409 

4 

.  0152 

.  0358 

4 

.  0753 

.  2314 

3 

.  5125 

.  4643 

3 

.  0023 

.  0190 

3 

.  0849 

.2414 

2 

.  5515 

.4972 

2 

.  0004 

.0109 

2 

.  1059 

.  2603 

1 

.  8239 

.  5832 

1 

.  0001 

.  0066 

1 

.1137 

.  2691 

p/q 

.  1 

.  3 

p/q 

.  1 

.  3 

p/q 

.  1 

.  3 

j  «  i. <*  = 

127 

j 

=  5,  0<  =  .  1  27 

asms 

9 

.  1308 

.  1308 

9 

.  1077 

.  1077 

9 

.  1308 

.  1308 

8 

.  1439 

.  1347 

8 

.  1067 

.  1074 

8 

.  0366 

.  1026 

7 

.  1668 

.  1438 

7 

.  1155 

.1105 

7 

.  0277 

.  09  35 

6 

.  2003 

.  1585 

6 

.1610 

.1285 

6 

.  0290 

.  09  20 

5 

.  2498 

.  1785 

5 

.  2405 

.  1584 

5 

.  0362 

.  o%q 

4 

.  3374 

.  2085 

4 

.  0450 

.  0927 

4 

.  0514 

.  1049 

3 

.  3929 

.  2388 

3 

.  0078 

.  0558 

3 

.  0651 

.  1242 

2 

.4326 

.  2760 

2 

.  0041 

.  0438 

2 

.  0830 

.  1445 

1 

.7809 

.  3884 

1 

.0015 

.  0378 

1 

.  1078 

.  1792 

p/q 

.  1 

.  3 

p/  q 

.  1 

.  3 

p/q 

.  1 

.  3 

CHAPTER  4 


CONCLUSION 

In  Section  1  properties  of  the  one-sided  test  statistic  S^  ,  the 

generalized  likelihood  ratio  test  statistic  C  and  Page's  test 

p,  q 

statistic  M  are  reviewed;  circumstances  indicating  preference  for 
use  of  one  of  these  statistics  over  the  others  are  outlined.  Suggestions 
for  experimental  design  using  the  test  statistics  S^  and  S^  are 
presented  in  Section  2.  Lastly  directions  for  future  research  are  dis¬ 
cussed  in  Section  3. 

4.  1  S+  or  C  or  M 

n  p,  q 

Advantages  and  appropriateness  for  use  of  one  of  the  test 
statistics  over  the  others  are  discussed. 

Of  the  three  test  statistics,  construction  of  critical  regions 
and  evaluation  of  the  statistic  for  a  sample  sequence  are  more  difficult 
for  C  .  Furthermore  q  ,  as  well  as  p  ,  must  be  known  to  perform 

p,  q  K 

the  required  calculations;  in  general  it  is  not  reasonable  to  expect 
that  q  is  known  however.  Tabulation  of  critical  regions  for  this  test 
take  up  a  great  deal  of  space.  Less  effort  is  required  to  construct 
critical  regions  and  evaluate  the  statistic  for  a  sample  sequence  for  the 
test  statistics  S*  and  M  ;  only  knowledge  of  p  is  required. 


A 
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The  types  of  sequences  in  the  critical  regions  of  the  tests  indicate 
the  appropriateness  of  the  statistics  for  the  hypothesis  being  tested. 

The  tendency  for  the  tests  based  on  C  and  M  is  to  put  those 

p»  q 

sequences  in  the  critical  region  for  which  the  estimate  of  the  change 

point  is  0  or  near  0  .  Hence  detection  of  later  changes  is  difficult 

or  impossible  for  many  values  of  p  ;  for  the  one-sided  tests  with 

p  >  q  this  tendency  becomes  more  severe  as  p  decreases.  It  appears 

that  the  tests  based  on  C  ^  and  M  more  appropriately  are  testing 

no  change  versus  a  change  either  before  or  after  sampling  began. 

On  the  other  hand,  for  whenever  the  null  hypothesis  of  no  change 

is  rejected,  each  point  pc  {l,***,n-l}  has  positive  probability  of  being 

estimated  regardless  of  the  value  of  p  ;  moreover  for  any  reasonable 

size  of  test  and  for  any  value  of  p  ,  the  change  point  cannot  be  estimated 

to  have  occurred  before  sampling  began.  The  test  based  on  S+  is  more 

appropriately  testing  no  change  versus  a  change  after  sampling  began. 

For  C  and  M  ,  rejection  of  the  null  hypothesis  is  difficult 
p»  q 

when  p  is  small;  for  example  when  n  =  5,  10,  20  and  p  =  .  1  the 

smallest  values  of  for  which  the  null  hypothesis  can  be  rejected  are 

<x=.  59,  .  35,  .  12,  respectively.  When  p  is  large,  however,  this  is  not 

a  problem;  for  n  =  5,  p  =  .9  and  q  =  .  7,  the  critical  regions  contain  26 

+  +  + 1 

of  the  32  sequences  for  =  .  08.  For  ,  P(S^  >  s  |  p)  = 

P(S*  >  s+  jl-p)  ;  the  rejection  of  the  null  hypothesis  can  be  realized 
for  a  much  larger  interval  of  p  values. 
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A  disturbing  property  of  Page's  test  is  that  x  , • • • , 

and  x  ,  ••  •  ,x,  give  the  same  value  of  M  .  His  test  is  behaving 

more  like  a  two-sided  test  than  a  one-sided  test  as  is  proposed. 

A  comparison  of  the  power  functions  for  the  tests  based  on 

S*  ,  C  and  M  indicates  that  the  pov/er  of  S  is  larger  than 
n  p,  q  n 

the  power  of  C  or  M  when  p  is  large  and  p  >  -,or  p  is 

p,  q  2 

small,  and  is  competitive  with  the  power  of  ^  or  M  when  p 

is  large  and  pe(6,  where  6  is  not  small;  the  powers  for  ^ 

and  M  are  larger  than  that  for  S*  otherwise. 

In  summary,  the  test  based  on  C  or  M  is  preferred  if 

P»  9 

one  suspects  that  a  change  is  more  likely  to  have  occurred  either 
before  sampling  began  or  shortly  thereafter  and  if  p  is  not  small. 

If  p  is  small  or  if  one  suspects  that  a  change  is  more  likely  to  have 
occurred  later  in  the  sequence,  the  test  based  on  S*  is  preferred; 
hence  for  those  situations  where  detecting  a  change  quickly  is  of  im¬ 
portance,  the  test  based  on  S*  is  more  appropriate. 

4.  2  Suggestions  for  Experimental  Design 

For  the  general  problem  where  ^F(y;  0)  for  i  =  1,  •  •  •  ,  p 
and  Y  F(y;  0’ )  for  i  =  p+1,  •  •  • ,  n  and  0  is  a  location  parameter, 
the  proposed  change  point  tests  can  be  used  to  construct  the  appropriate 
one-  or  two-sided  nonparametr ic  tests  for  detecting  a  change  in  the 
model.  The  construction  requires  definition  of  the  X  's  using  some 

i 

function  of  the  location  parameter,  say  g(0)  ,  such  that 
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I  1  if  Y.e  {  g( 0 ) r  0e  ®Q  } 

1  \-l  if  Y.  ^  { g(9)i  9e  »Q}  , 

i  =  l,»**,n  ;  the  function  g(9)  determines  the  probability 

p  =  P(X.  =  1  |  Hq)  .  The  choice  of  the  function  g(.  )  should  be 

made  to  optimize  properties  of  the  test.  Since  properties  of  the 

one-sided  test  based  on  S+  are  better  than  those  for  the  two- 

n 

sided  test  based  on  S  ,  the  one-sided  test  is  preferred:  if  one  has 

n  r 

reason  to  suspect  a  change  in  the  location  parameter  in  one  direction, 

this  information  should  be  used  to  construct  a  one-sided  test. 

Depending  on  the  situation  one  may  also  have  information  as  to  where 

an  anticipated  change  point  might  have  occurred.  This  information 

should  also  be  used  advantageously. 

Since  detection  of  change  points  quickly  is  generally  of  concern, 

it  is  suggested  that  an  anticipated  change  point  be  placed  near  the  end  of 

the  sequence  of  observations  Y  ,♦••,  Y  .  When  the  distribution 

1  n 

F  and  the  alternative  0'  is  not  known  except  that  it  is  say  less  than 
0  ,  choosing  g(0)  to  be  the  sample  median  of  the  distribution  F(x;  0) 
is  appropriate;  past  data  are  used  to  estimate  the  median. 

4.  3  Directions  for  Future  Research 

The  manners  in  which  the  test  statistics  S+  and  S  may  be 

n  p  7 

modified  or  extended  to  improve  on  this  intuitive  test  for  a  change 


point  are  many. 
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For  those  instances  in  which  a  change  in  the  model  may  have 
occurred  before  sampling  began,  one  could  modify  the  statistic  S* 
as  follows: 


sup 


1 


1_ 

n 


n 

2 

X,,  sup 

\ 

fl 

Ir 

r 

2 

1  n 

X  -  — -  2 

i  n-r  . 

i=  1 

r  =  1 ,••• 

,  n 

i=l 

i=r+l 

X. 


A  similar  modification  may  be  made  for  .  There  is  good  reason 
to  suspect  that  the  power  of  the  test  based  on  this  statistic  will  be 
greater  than  that  for  when  p  is  small.  Alternatively  when  a 
change  in  the  model  may  have  occurred  before  sampling  began,  one 
might  construct  a  test  based  on  the  union-intersection  principle 
using  S+  and  Page's  test  statistic  M. 

4 

Attempts  for  finding  approximate,  rather  than  exact,  critical 
regions  and  properties  of  the  tests  could  be  pursued  to  obtain  a 
change  point  test  for  larger  values  of  n  .  The  behavior  of  the  test 
statistics  for  n  <  ZO  suggest  that  such  attempts  are  possible. 

The  situation  where  the  X.'s  are  not  distributed  as  Bernoulli 

i 

random  variables  is  also  of  interest. 

The  behavior  of  the  test  statistics  when  the  observations  are 
dependent  could  also  be  investigated. 
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